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CANCER ASSOCIATED PROTEIN PHOSPHATASES AND THEIR USES 



BACKGROUND OF THE INVENTION 
5 An accumulation of genetic changes underlies the development and progression of 

cancer, resulting in cells that differ from normal cells in their behavior, biochemistry, genetics, 
and microscopic appearance. Mutations in DNA that cause changes in the expression level 
of key proteins, or in the biological activity of proteins, are thought to be at the heart of 
cancer. For example, cancer can be triggered when genes that play a critical role in the 

10 regulation of cell division undergo mutations that lead to their over-expression. "Oncogenes" 
are involved in the dysregulation of growth that occurs in cancers. An aspect of oncogenesis 
that is often linked to tumor growth is angiogenesis. The growth of new blood vessels is 
essential for the later stages of solid tumor growth. Angiogenesis is caused by the migration 
and proliferation of the endothelial cells that form blood vessels. 

15 Oncogene activity may involve kinases and phosphatases, enzymes that help 

regulate many cellular activities, particularly signaling from the cell membrane to the nucleus 
to initiate the cell's entrance into the cell cycle and to control other functions. These signaling 
pathways may involve kinases and phosphatases of proteins, or kinases or phosphatases of 
phosphatidylinositol (PI) lipids. PI is unique among membrane lipids because it can undergo 

20 reversible phosphorylation at multiple sites to generate a variety of distinct inositol 
phospholipids which participate in many aspects in the development, in particular in 
promoting cell survival and growth. Thus many kinases and phosphatases that are involved 
in regulating the generation of inositol phospholipids are likely to participate in oncogenesis. 
Oncogenes may be tumor susceptibility genes, which are typically up-regulated in 

25 tumor cells, or may be tumor suppressor genes, which are down-regulated or absent in 
tumor cells. Malignancies can arise when a tumor suppressor is lost and/or an oncogene is 
inappropriately activated. When such mutations occur in somatic cells, they result in the 
growth of sporadic tumors. 

Hundreds of genes have been implicated in cancer, but in most cases relationships 

30 between these genes and their effects are poorly understood. Using massively parallel gene 
expression analysis, scientists can now begin to connect these genes into related pathways. 

Phosphorylation is important in signal transduction mediated by receptors via 
extracellular biological signals such as growth factors or hormones. For example, many 
oncogenes are kinases or phosphatases, i.e. enzymes that catalyze protein phosphorylation 

35 or dephosphorylation reactions or are specifically regulated by phosphorylation. In addition, 
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a kinase or phosphatase can have its activity regulated by one or more distinct kinase or 
phosphatases, resulting in specific signaling cascades. 

Cloning procedures aided by homology searches of expressed sequence tag (EST) 
databases have accelerated the pace of discovery of new genes, but EST database 
5 searching remains an involved and onerous task. More than 3.6 million human EST 
sequences have been deposited in public databases, making it difficult to identify ESTs that 
represent new genes. Compounding the problems of scale are difficulties in detection 
associated with a high sequencing error rate and low sequence similarity between distant 
homologues. 

10 Despite a long-felt need to understand and discover methods for regulating cells 

involved in various disease states, the complexity of signal transduction pathways has been 
a barrier to the development of products and processes for such regulation. Accordingly, 
there is a need in the art for improved methods for detecting and modulating the activity of 
such genes, and for treating diseases associated with the cancer and signal transduction 

15 pathways. 

RELEVANT LITERATURE 
The use of genomic sequence in data mining for signaling proteins is discussed in 
Schultz et al. Nature Genetics (2000) 25:201. Serine/threonine kinases and phosphatases 
20 have been reviewed, for example, by Cross TG et al. Exp Cell Res (2000) 256(1 ):34-41. PI 
signaling pathways are reviewed, for example, by Irvine in Curr. Opin. Cell Bio. (1992) 
4:212-219. 

SUMMARY OF THE INVENTION 

25 Several protein and phosphatidylinositol lipid phosphatases are herein shown to be 

over-expressed in hyper-proliferative cells. Detection of expression in hyper-proliferative 
cells is useful as a diagnostic; for determining the effectiveness and mechanism of action of 
therapeutic drug candidates, and for determining patient prognosis. These phosphatase 
sequences further provide a target for screening pharmaceutical agents effective in treating 

30 hyper-proliferative disorders. In a further embodiment, the present invention provides 
methods and compositions relating to agents, particularly antibodies that specifically bind to 
the phosphatase proteins, for treatment and visualization of hyper-proliferative disorders in 
patients. 

35 



2 




10 



WO 03/083102 PCT/CA03/00393 



DETAILED DESCRIPTION 
The MKPX, PTP4A1, PTPN7, FEM-2(formerly KIAA0015), DKFZP566K0524 and 
FLJ20313 phosphatases are shown to be over-expressed in cancer cells. The encoded 
polypeptides provide targets for drug screening or altering expression levels, and for 
determining other molecular targets in phosphatase signal transduction pathways involved in 
transformation and growth of tumoi cells. Detection of over-expression in cancers provides a 
useful diagnostic for predicting patient prognosis and probability of drug effectiveness. The 
present invention further provides methods and compositions relating to agents that 
specifically bind to these phosphatases, for treatment and visualization of tumors in patients. 



PHOSPHATASES 

The human cDNA sequences encoding MKPX, PTP4A1, PTPN7, FEM-2, 
DKFZP566K0524 and FLJ20313 are provided as SEQ ID NOS:1, 3, 5, 7, 9 and 11 
respectively and the encoded polypeptide product is provided as SEQ ID NOS:2, 4, 6, 8, 10 

15 and 1 2 respectively. Dot blot analysis of probes prepared from mRNA of tumors showed that 
expression of these genes are up-regulated In clinical samples of human tumors. 

MKPX phosphatase. Activated mitogen-activated protein (MAP) kinases play an 
essential role controlling many cell division functions. Dual specificity protein phosphatases 
elicit selective inactivation of MAP kinases and are under tight transcriptional control. MKPX 

20 phosphatase is dual-specific protein phosphatase. The open reading frame of MKPX predicts 
a protein of 184 amino acids related to the Vaccinia virus VH1 and human VH1-related 
(VHR) phosphatases. Expression VHR-related MKPX is highest in thymus, but also 
detectable in monocytes and lymphocytes. A MKPX-specific antiserum detects a protein 
with an apparent molecular mass of 19 kDa in many cells, including T lymphocytes and 

25 monocytes. MKPX expression was not induced by T cell activation, but decreased somewhat 
at later time points. In vitro, MKPX dephosphorylated the Erk2 mitogen-activated protein 
kinase with faster kinetics than did VHR, which is thought to be specific for Erk1 and 2. When 
expressed in Jurkat T cells, MKPX has the capacity to suppress T cell antigen receptor- 
induced activation of Erk2 and of an NFAT/AP-1 luciferase reporter, but not an NF-[kappa]B 

30 reporter. MKPX is a member of the VH1/VHR group of small dual-specific phosphatases that 
act in mitogen-activated protein kinase signaling pathways (Alonso ef a/. J Biol Chem (2002) 
277:5524-5528). 

PTP4A1, otherwise known as protein tyrosine phosphatase IVA member 1 or PRL-1 
is similar to the rat PRL-1 gene. Expression of the rat PRL-1 gene, which encodes a unique 
35 nuclear protein tyrosine phosphatase, is positively associated with cellular growth during liver 
development, regeneration, and oncogenesis but with differentiation in intestine and other 
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tissues. The human PRL-1 gene is localized to chromosome 6 within band q12. Human, rat, 
and mouse PRL-1 are 100% conserved at the amino acid level and 55% identical to a newly 
identified Caenorhabditis elegans PRL-1. Two promoter activities, P1 and P2, are present in 
the human PRL-1 gene. An enhancer that bound a developmentally regulated factor, PRL-1 

5 intron enhancer complex (PIEC), was localized to the first intron of the human PRL-1 gene. 
The presence- of PIEC correlates with the ability- of the intron enhancer to confer 
transcriptional activation in HepG2 and F9 cells. The intron enhancer contributes significantly 
to PRL-1 promoter activity in HepG2 cells which contain PIEC but not to NIH 3T3 cells which 
do not (Peng etal. (1998) J. Biol. Chem. 273 (27): 17286-17295) 

10 PTPN7. Protein tyrosine phosphatase, non-receptor type 7 (PTPN7) is involved in 

lymphocyte development and signal transduction. Tyrosine phosphorylation and 
dephosphorylating events have been shown to be central to the process of growth regulation 
and signal transduction. PTPN7 contains a tyrosine phosphatase domain and is expressed 
exclusively in thymus and spleen. A cDNA of 2760 bp encodes a 339-amino acid, 

15 intracellular, single-domain tyrosine phosphatase. When expressed as a glutathionine-S- 
transferase fusion protein, efficient lysis of p-nitrophenyl phosphate is noted, indicating in 
vitro enzymatic activity of the cloned gene product. Normal mouse lymphocytes increase 
mRNA expression 10-15-fold upon stimulation with phytohemagglutinin, concanavalin A, 
lipopolysaccharide or anti-CD3 monoclonal antibody. This hematopoietic tyrosine 

20 phosphatase may play a role in the regulation of T and B lymphocyte development and 
signal transduction (Zanke etal., Eur J Immunol (1992) 22:235-9). 

FEM-2. FEM-2, formerly known as KIAA0015, represents is thought to be a 
Ca2+/calmodulin-dependent protein kinase phosphatases that promote apoptosis (Tan et a/. 
J Biol Chem (2001) 276(47):441 93-202). In Caenorhabditis elegans, fem-1, fem-2, and fem-3 

25 play pivotal roles in sex determination. A mammalian homologue of the C. elegans sex- 
determining protein FEM-1, FIAalpha, has been described. Although there is little evidence 
to link FIAalpha to sex determination, FIAalpha and FEM-1 both promote apoptosis in 
mammalian cells. Human FEM2 (hFEM-2) is similar to C. elegans FEM-2 and exhibits PP2C 
phosphatase activity and associates with FEM-3. hFEM-2 shows striking similarity (79% 

30 amino acid identity) to rat Ca(2+)/calmodulin (CaM)-dependent protein kinase phosphatase 
(rCaMKPase). hFEM-2 and FEM-2, but not PP2Calpha, were demonstrated to 
dephosphorylate CaM kinase II efficiently in vitro, suggesting that hFEM-2 is a specific 
phosphatase for CaM kinase. Furthermore, hFEM-2 and FEM-2 associated with FIAalpha 
and FEM-1 respectively. Overexpression of hFEM-2, FEM-2, or rCaMKPase all mediated 

35 apoptosis in mammalian cells. The catalytically active, but not the inactive, forms of hFEM-2 
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induced caspase-dependent apoptosis, which was blocked by Bcl-XL or a dominant negative 
mutant of caspase-9. Human FEM-2 is likely to be a conserved CaM kinase phosphatases 
that plays a role in apoptosis signaling. 

DKFZP566K0524. The function of human sequence DKFZP566K0524 is not known. 
5 However it is related to the protein tyrosine phosphatase, non-receptor type 20 gene of mice 
(Ohsugi et al. (1997) J Biol .Chem 272:33092-9). This gene encodes a protejn-tyrosine 
phosphatase expressed exclusively in mice testis. The gene encodes an open reading frame 
of 426 amino acids containing a single catalytic domain in the carboxyl-terminal half. Indirect 
immunofluorescence studies and in situ hybridization analysis showed that this protein was 

10 specifically expressed in testicular germ cells that have undergone meiosis. 
Developmentally, the mouse protein is detected between 2 and 3 weeks after birth, in parallel 
with the onset of meiosis. The mouse protein is a member of the cytoplasmic protein-tyrosine 
phosphatases that may play an important role(s) in spermatogenesis and/or meiosis (Ohsugi 
etal. J Biol Chem (1997) 272:33092-9). 

15 FLJ20313. However, FLJ20313 shows similarity to the phosphatidylinositol-3 

phosphate 3-phosphatase adaptor subunit. D3-phosphoinositides act as second messengers 
by recruiting, and thereby activating, diverse signaling proteins. The rat phosphatidylinositol 
3-phosphate [Ptdlns(3)P] 3-phosphatase, comprising a heterodimer of a 78-kDa adapter 
subunit in complex with a 65-kDa catalytic subunit. The human 3-phosphatase adapter 

20 subunit (3-PAP) shares significant sequence similarity with the protein and lipid 3- 
phosphatase myotubularin, and with several other members of the myotubularin gene family 
including SET-binding factor 1. However, unlike myotubularin, 3-PAP does not contain a 
consensus HCX(5)R catalytic motif. The 3-PAP sequence contains several motifs that predict 
interaction with proteins containing Src homology-2 (SH2) domains, phosphotyrosine-binding 

25 (PTB) domains, members of the 14-3-3 family, as well as proteins with SET domains. 
Northern blot analysis identified two transcripts (5.5 kb and 2.5 kb) with highest abundance in 
human liver, kidney, lung, and placenta. 3-PAP immunoprecipitates isolated from platelet 
cytosol hydrolyzed the D3-phosphate from Ptdlns(3)P and Ptdlns 3,4-bisphosphate 
[Ptdlns(3,4)P(2)]. However, insect cell-expressed 3-PAP recombinant protein was 

30 catalytically inactive. The 3-PAP polypeptide may therefore be an adapter subunit 
(Nandurkar et al Proc Natl Acad Sci USA (2001) 98(17):9499-504). 

HYPER-PROLIFERATIVE DISORDERS OF INTEREST 
The subject genes are used to diagnose a hyper-proliferative disorder, or their 

35 
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activities manipulated to treat a hyperproliferative disorders, e.g. to inhibit tumor growth, to 
inhibit angiogenesis, to decrease inflammation associated with a lymphoproliferative 
disorder, to inhibit graft rejection, or neurological damage due to tissue repair, efc. There are 
many disorders associated with a dysregulation of cellular proliferation. The conditions of 

5 interest include, but are not limited to, the following conditions. 

The subject methods are applied to the treatment of--a variety of conditions where 
there is proliferation and/or migration of smooth muscle cells, and/or inflammatory cells into 
the intimal layer of a vessel, resulting in restricted blood flow through that vessel, i.e. 
neointimal occlusive lesions. Occlusive vascular conditions of interest include 

10 atherosclerosis, graft coronary vascular disease after transplantation, vein graft stenosis, 
peri-anastomatic prosthetic graft stenosis, restenosis after angioplasty or stent placement, 
and the like. 

Diseases where there is hyperproliferation and tissue remodeling or repair of 
reproductive tissue, e.g. uterine, testicular and ovarian carcinomas, endometriosis, 

15 squamous and glandular epithelial carcinomas of the cervix, etc. are reduced in cell number 
by administration of the subject compounds 

Tumor cells are characterized by uncontrolled growth, invasion to surrounding 
tissues, and metastatic spread to distant sites. Growth and expansion requires an ability not 
only to proliferate, but also to down-modulate cell death (apoptosis) and activate 

20 angiogenesis to produce a tumor neovasculature. Angiogenesis may be inhibited by 
affecting the cellular ability to interact with the extracellular environment and to migrate, 
which is an integrin-specific function, or by regulating apoptosis of the endothelial cells. 
Integrins function in cell-to-cell and cell-to-extracellular matrix (ECM) adhesive interactions 
and transduce signals from the ECM to the cell interior and vice versa. Since these 

25 properties implicate integrin involvement in cell migration, invasion, intra- and extra-vasation, 
and platelet interaction, a role for integrins in tumor growth and metastasis is obvious. 

Tumors of interest for treatment include carcinomas, e.g. colon, duodenal, prostate, 
ovarian, breast, melanoma, ductal, hepatic, pancreatic, renal, endometrial, stomach, 
dysplastic oral mucosa, polyposis, invasive oral cancer, non-small cell lung carcinoma, 

30 transitional and squamous cell urinary carcinoma efc; neurological malignancies, e.g. 
neuroblastoma, gliomas, efc; hematological malignancies, e.g. childhood acute leukaemia, 
non-Hodgkin's lymphomas, chronic lymphocytic leukaemia, malignant cutaneous T-cells, 
mycosis fungoides, non-MF cutaneous T-cell lymphoma, lymphomatoid papulosis, T-cell rich 
cutaneous lymphoid hyperplasia, bullous pemphigoid, discoid lupus erythematosus, lichen 

35 planus, efc; and the like. 
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Some cancers of particular interest include breast cancers, which are primarily 
adenocarcinoma subtypes. Ductal carcinoma in situ is the most common type of noninvasive 
breast cancer. In DCIS, the malignant cells have not metastasized through the walls of the 
ducts into the fatty tissue of the breast. Infiltrating (or invasive) ductal carcinoma (IDC) has 

5 metastasized through the wall of the duct and invaded the fatty tissue of the breast. 
Infiltrating (or invasive) lobular carcinoma (tLC) is similar to IDC, in that it has the potential 
metastasize elsewhere in the body. About 10% to 15% of invasive breast cancers are 
invasive lobular carcinomas. 

Also of interest is non-small cell lung carcinoma. Non-small cell lung cancer (NSCLC) 

10 is made up of three general subtypes of lung cancer. Epidermoid carcinoma (also called 
squamous cell carcinoma) usually starts in one of the larger bronchial tubes and grows 
relatively slowly. The size of these tumors can range from very small to quite large. 
Adenocarcinoma starts growing near the outside surface of the lung and may vary in both 
size and growth rate. Some slowly growing adenocarcinomas are described as alveolar cell 

15 cancer. Large cell carcinoma starts near the surface of the lung, grows rapidly, and the 
growth is usually fairly large when diagnosed. Other less common forms of lung cancer are 
carcinoid, cylindroma, mucoepidermoid, and malignant mesothelioma. 

Melanoma is a malignant tumor of melanocytes. Although most melanomas arise in 
the skin, they also may arise from mucosal surfaces or at other sites to which neural crest 

20 cells migrate. Melanoma occurs predominantly in adults, and more than half of the cases 
arise in apparently normal areas of the skin. Prognosis is affected by clinical and histological 
factors and by anatomic location of the lesion. Thickness and/or level of invasion of the 
melanoma, mitotic index, tumor infiltrating lymphocytes, and ulceration or bleeding at the 
primary site affect the prognosis. Clinical staging is based on whether the tumor has spread 

25 to regional lymph nodes or distant sites. For disease clinically confined to the primary site, 
the greater the thickness and depth of local invasion of the melanoma, the higher the chance 
of lymph node metastases and the worse the prognosis. Melanoma can spread by local 
extension (through lymphatics) and/or by hematogenous routes to distant sites. Any organ 
may be involved by metastases, but lungs and liver are common sites. 

30 Other hyperproliferative diseases of interest relate to epidermal hyperproliferation, 

tissue remodeling and repair. For example, the chronic skin inflammation of psoriasis is 
associated with hyperplastic epidermal keratinocytes as well as infiltrating mononuclear cells, 
including CD4+ memory T cells, neutrophils and macrophages. 

The proliferation of immune cells is associated with a number of autoimmune and 

35 lymphoproliferative disorders. Diseases of interest include multiple sclerosis, rheumatoid 
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arthritis and insulin dependent diabetes mellitus. Evidence suggests that abnormalities in 
apoptosis play a part in the pathogenesis of systemic lupus erythematosus (SLE). Other 
lymphoproliferative conditions the inherited disorder of lymphocyte apoptosis, which is an 
autoimmune lymphoproliferative syndrome, as well as a number of leukemias and 

5 lymphomas- Symptoms of allergies to environmental and food agents, as well as 
— inflammatory bowel disease, may also be alleviated by the compounds of the invention, _ ~ 
Conditions treatable by inhibiting a molecule of the invention also include those 
associated with defects in cell cycle regulation or in response to extracellular signals, e.g. 
hyperglycemia and diabetes Type I and Type II, immunological disorders, e.g. autoimmune 

10 and immunodeficiency diseases; hyperproliferative disorders, which may include psoriasis, 
arthritis, inflammation, angiogenesis, endometriosis, scarring, cancer, etc. 

DIAGNOSTIC APPLICATIONS 
Determination of the presence of MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 

15 or FLJ20313 is used in the diagnosis, typing and staging of tumors. Detection of the 
presence of these phosphatases is performed by the use of a specific binding pair member 
to quantitate the specific protein, DNA or RNA present in a patient sample. Generally the 
sample will be a biopsy or other cell sample from the tumor. Where the tumor has 
metastasized, blood samples may be analyzed. MKPX, PTP4A1, PTPN7, FEM-2, 

20 DKFZP566K0524 or FLJ20313 can be used in screening methods to identify candidate 
therapeutic agents and other therapeutic targets. Methods providing agents that bind to 
these proteins are provided as cancer treatments and for cancer imaging. 

In a typical assay, a tissue sample, e.g. biopsy, blood sample, efc. is assayed for the 
presence of MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 specific 

25 sequences by combining the sample with a specific binding member, and detecting directly 
or indirectly the presence of the complex formed between the two members. The term 
"specific binding member" as. used herein refers to a member of a specific binding pair, i.e. 
two molecules where one of the molecules through chemical or physical means specifically 
binds to the other molecule. One of the molecules will be a nucleic acid e.g. corresponding 

30 to SEQ ID NOS:1, 3, 5, 7, 9 or 11, or a polypeptide encoded by the nucleic acid, which can 
include any protein substantially similar to the proteins or a fragment thereof; or any nucleic 
acid substantially similar to the nucleotide sequence provided in SEQ ID NOS:1, 3, 5, 7, 9 or 
11 or a fragment thereof. The complementary members of a specific binding pair are 
sometimes referred to as a ligand and receptor. 

35 Binding pairs of interest include antigen and antibody specific binding pairs, peptide- 

MHC antigen and T-cell receptor pairs; complementary nucleotide sequences (including 
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nucleic acid sequences used as probes and capture agents in DNA hybridization assays); 
phosphatase protein and substrate pairs; autologous monoclonal antibodies, and the like. 
The specific binding pairs may include analogs, derivatives and fragments of the original 
specific binding member. For example, an antibody directed to a protein antigen may also 

5 recognize peptide fragments, chemically synthesized peptidomimetics, labeled protein, 
derivatized protein, etc. so long as an epitope is present ■_ - -.. 

Nucleic acid sequences. Nucleic acids encoding MKPX, PTP4A1, PTPN7, FEM-2, 
DKFZP566K0524 or FLJ20313 are useful in the methods of the invention, e.g. as a specific 
binding member, to produce the encoded polypeptide, etc. The nucleic acids of the invention 

10 also include nucleic acids having a high degree of sequence similarity or sequence identity to 
SEQ ID NOS:1, 3, 5, 7, 9 or 11. Sequence identity can be determined by hybridization under 
stringent conditions, for example, at 50°C or higher and 0.1XSSC (9 mM saline/0.9 mM 
sodium citrate). Hybridization methods and conditions are well known in the art, see, e.g., 
U.S. patent 5,707,829. Nucleic acids that are substantially identical to the provided nucleic 

15 acid sequence, e.g. allelic variants, genetically altered versions of the gene, etc., bind to 
SEQ ID NOS:1, 3, 5, 7, 9 or 11 under stringent hybridization conditions. 

The nucleic acids can be cDNAs or genomic DNAs, as well as fragments thereof. 
The term "cDNA" as used herein is intended to include all nucleic acids that share the 
arrangement of sequence elements found in native mature mRNA species, where sequence 

20 elements are exons and 3* and 5' non-coding regions. Normally mRNA species have 
contiguous exons, with the intervening introns, when present, being removed by nuclear 
RNA splicing, to create a continuous open reading frame encoding a polypeptide of the 
invention. 

A genomic sequence of interest comprises the nucleic acid present between the 
25 initiation codon and the stop codon, as defined in the listed sequences, including all of the 
introns that are normally present in a native chromosome. It can further include the 3* and 5' 
untranslated regions found in the mature mRNA. It can further include specific transcriptional 
and translational regulatory sequences, such as promoters, enhancers, efc, including about 
1 kb, but possibly more, of flanking genomic DNA at either the 5' or 3' end of the transcribed 
30 region. The genomic DNA flanking the coding region, either 3" or 5', or internal regulatory 
sequences as sometimes found in introns, contains sequences required for proper tissue, 
stage-specific, or disease-state specific expression, and are useful for investigating the up- 
regulation of expression in tumor cells. 

Probes specific to the nucleic acid of the invention can be generated using an nucleic 
35. acid sequence, e.g. as disclosed in SEQ ID NOS:1, 3, 5, 7, 9 or 11. The probes are 
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preferably at least about 1 8 nt f 25 nt f 50 nt or more of the corresponding contiguous, and are 
usually less than about 2, 1, or 0.5 kb in length. Preferably, probes are designed based on a 
contiguous sequence that remains unmasked following application of a masking program for 
masking low complexity, e.g. BLASTX. Double or single stranded fragments can be obtained 

5 from the DNA sequence by chemically synthesizing oligonucleotides in accordance with 
conventional methods, by restriction enzyme-digestion, by PGR amplification, efc. The 
probes can be labeled, for example, with a radioactive, biotinylated, or fluorescent tag. 

The nucleic acids of the subject invention are isolated and obtained in substantial 
purity, generally as other than an intact chromosome. Usually, the nucleic acids, either as 

10 DNA or RNA, will be obtained substantially free of other naturally-occurring nucleic acid 
sequences, generally being at least about 50%, usually at least about 90% pure and are 
typically "recombinant," e.g., flanked by one or more nucleotides with which it is not normally 
associated on a naturally occurring chromosome. 

The nucleic acids of the invention can be provided as a linear molecule or within a 

15 circular molecule, and can be provided within autonomously replicating molecules (vectors) 
or within molecules without replication sequences. Expression of the nucleic acids can be 
regulated by their own or by other regulatory sequences known in the art. The nucleic acids 
of the invention can be introduced into suitable host cells using a variety of techniques 
available in the art, such as transferrin polycation-mediated DNA transfer, transfection with 

20 naked or encapsulated nucleic acids, liposome-mediated DNA transfer, intracellular 
transportation of DNA-coated latex beads, protoplast fusion, viral infection, electroporation, 
gene gun, calcium phosphate-mediated transfection, and the like. 

For use in amplification reactions, such as PCR, a pair of primers will be used. The 
exact composition of the primer sequences is not critical to the invention, but for most 

25 applications the primers will hybridize to the subject sequence under stringent conditions, as 
known in the art. It is preferable to choose a pair of primers that will generate an 
amplification product of at least about 50 nt, preferably at least about 100 nt. Algorithms for 
the selection of primer sequences are generally known, and are available in commercial 
software packages. Amplification primers hybridize to complementary strands of DNA, and 

30 will prime towards each other. For hybridization probes, it may be desirable to use nucleic 
acid analogs, in order to improve the stability and binding affinity. The term "nucleic acid" 
shall be understood to encompass such analogs. 

Polypeptide Compositions. The present invention further provides polypeptides 
encoded by SEQ ID NOS:1, 3, 5, 7, 9 and 11 and variants thereof, which can be used for a 

35 variety of purposes. The polypeptides contemplated by the invention include those encoded 
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by the disclosed nucleic acids, as well as nucleic acids that, by virtue of the degeneracy of 
the genetic code, are not identical in sequence to the disclosed nucleic acids, and vairants 
thereof. 

In general, the term "polypeptide" as used herein refers to both the full length 
polypeptide encoded by the recited nucleic acid, the polypeptide encoded by the gene 
represented by the recited~Tiucleic acid, as well as portions or fragments thereof. 
"Polypeptides" also includes variants of the naturally occurring proteins, where such variants 
are homologous or substantially similar to the naturally occurring protein, and can be of an 
origin of the same or different species as the naturally occurring protein (e.g., human, 
murine, or some other species that naturally expresses the recited polypeptide, usually a 
mammalian species). In general, variant polypeptides have a sequence that has at least 
about 80%, usually at least about 90%, and more usually at least about 98% sequence 
identity with a differentially expressed polypeptide described herein, as measured by BLAST 
2.0 using the parameters described above. The variant polypeptides can be naturally or non- 
naturally glycosylated, i.e., the polypeptide has a glycosylation pattern that differs from the 
glycosylation pattern found in the corresponding naturally occurring protein. 

In general, the polypeptides of the subject invention are provided in a non-naturally 
occurring environment, e.g. are separated from their naturally occurring environment. In 
certain embodiments, the subject protein is present in a composition that is enriched for the 
protein as compared to a control. As such, purified polypeptides are provided, where by 
purified is meant that the protein is present in a composition that is substantially free of non- 
differentially expressed polypeptides, where by substantially free is meant that less than 
90%, usually less than 60% and more usually less than 50% of the composition is made up 
of non-MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 polypeptides. 

Variant polypeptides can include amino acid substitutions, additions or deletions. The 
amino acid substitutions can be conservative amino acid substitutions or substitutions to 
eliminate non-essential amino acids, such as to alter a glycosylation site, a phosphorylation 
site or an acetylation site, or to minimize misfolding by substitution or deletion of one or more 
cysteine residues that are not necessary for function. Conservative amino acid substitutions 
are those that preserve the general charge, hydrophobicity/hydrophilicity, and/or steric bulk 
of the amino acid substituted. Variants can be designed so as to retain or have enhanced 
biological activity of a particular region of the protein (e.g., a functional domain and/or, where 
the polypeptide is a member of a protein family, a region associated with a consensus 
sequence). 
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Variants also include fragments of the polypeptides disclosed herein, particularly 
biologically active fragments and/or fragments corresponding to functional domains. 
Fragments of interest will typically be at least about 10 aa to at least about 15 aa in length, 
usually at least about 50 aa in length, and can be as long as 300 aa in length or longer, but 
will usually not exceed about 500 aa in length, where the fragment will have a contiguous 
stretcirof amino acids that is identical to a polypeptide encoded by SEQ ID NOS:1, 3, 5, 7, .9 
or 1 1 , or a homolog thereof. 

Antibodies. As used herein, the term "antibodies" includes antibodies of any isotype, 
fragments of antibodies which retain specific binding to antigen, including, but not limited to. 
Fab, Fv, scFv, and Fd fragments, chimeric antibodies, humanized antibodies, single-chain 
antibodies, and fusion proteins comprising an antigen-binding portion of an antibody and a 
non-antibody protein. The antibodies may be detectably labeled, e.g., with a radioisotope, an 
enzyme which generates a detectable product, a green fluorescent protein, and the like. The 
antibodies may be further conjugated to other moieties, such as members of specific binding 
pairs, e.g., biotin (member of biotin-avidin specific binding pair), and the like. The antibodies 
may also be bound to a solid support, including, but not limited to, polystyrene plates or 
beads, and the like. 

"Antibody specificity", in the context of antibody-antigen interactions, is a term well 
understood in the art, and indicates that a given antibody binds to a given antigen, wherein 
the binding can be inhibited by that antigen or an epitope thereof which is recognized by the 
antibody, and does not substantially bind to unrelated antigens. Methods of determining 
specific antibody binding are well known to those skilled in the art, and can be used to 
determine the specificity of antibodies of the invention for a polypeptide, particularly MKPX, 
PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 . 

As used herein, a compound which specifically binds to human protein MKPX, 
PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 is any compound (such as an 
antibody) which has a binding affinity for any naturally occurring isoform, splice variant, or 
polymorphism. As one of ordinary skill in the art will appreciate, such "specific" binding 
compounds (e.g., antibodies) may also bind to other closely related proteins which exhibit 
significant homology, for example, having greater than 90% identity, more preferably greater 
than 95% identity, and most preferably greater than 99% identity with the amino acid 
sequence of SEQ ID NOS:2, 4, 6. 8, 10 or 12. Such proteins may include truncated forms or 
domains of SEQ ID NOS:2, 4, 6, 8, 10 or 12, and recombinantly engineered alterations of 
SEQ ID NOS:2, 4, 6, 8, 10 or 12. For example, a portion of SEQ ID NOS:2, 4, 6, 8, 10 or 12 
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may be engineered to encode a non-naturally occurring cysteine for cross-linking to an 
immunoconjugate protein, as described below. 

Selection of antibodies which alter (enhance or inhibit) the binding of a compound to 
MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 may be accomplished by a 
straightforward binding inhibition/enhancement assay. According to standard techniques, the 
binding of a labeled (e.g., fluorescently or enzyme-labeled) antibody to. a. protein of the 
invention, which has been immobilized in a microtiter well, is assayed using standard 
phosphatase assays in both the presence and absence of the ligand. The change in binding 
is indicative of either an enhancer (increased binding) or competitive inhibitor (decreased 
binding) relationship between the antibody and the ligand. Such assays may be carried out in 
high-throughput formats (e.g., 384 well plate formats, in robotic systems) for the automated 
selection of monoclonal antibody candidates for use as ligand or substrate-binding inhibitors 
or enhancers. 

In addition, antibodies that are useful for altering the function of a protein of the 
invention may be assayed in functional formats. In cell-based assays of activity, expression 
of a protein of the invention is first verified in the particular cell strain to be used. If 
necessary, the cell line may be stably transfected with a coding sequence under the control 
of an appropriate constituent promoter, in order to express a protein of the invention at a 
level comparable to that found in primary tumors. The ability of the tumor cells to survive in 
the presence of the candidate function-altering -antibody is then determined. Similarly, in 
vivo models for human cancer, particularly colon, pancreas, lung and ovarian cancer are 
available as nude mice/SCID mice or rats, have been described. Once expression of a 
protein of the invention in the tumor model is verified, the effect of the candidate antibodies 
on the tumor masses in these models can evaluated, wherein the ability of the antibody 
candidates to alter phosphatase activity is indicated by a decrease in tumor growth or a 
reduction in the tumor mass. Thus, antibodies that exhibit the appropriate anti-tumor effect 
may be selected without direct knowledge of a binding ligand. 

Generally, as the term is utilized in the specification, "antibody" or "antibody moiety" is 
intended to include any polypeptide chain-containing molecular structure that has a specific 
shape which fits to and recognizes an epitope, where one or more non-covalent binding 
interactions stabilize the complex between the molecular structure and the epitope. 
Antibodies which bind specifically to a protein of the invention are referred to as anti- 
phosphatase antibodies. The specific or selective fit of a given structure and its specific 
epitope is sometimes referred to as a "lock and key" fit. The archetypal antibody molecule is 
the immunoglobulin, and all types of immunoglobulins (IgG, IgM, IgA, IgE, IgD, etc.), from all 
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sources (e.g., human, rodent, rabbit, cow, sheep, pig, dog, other mammal, chicken, turkey, 
emu, other avians, etc.) are considered to be "antibodies." Antibodies utilized in the present 
invention may be polyclonal antibodies, although monoclonal antibodies are preferred 
because they may be reproduced by cell culture or recombinant^, and may be modified to 
5 reduce their antigenicity. 

Polyclonal antibodies may ber raised by a standard protocol by injecting a production 
animal with an antigenic composition, formulated as described above. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual . Cold Spring Harbor Laboratory, 1988. In one such 
technique, an antigenic portion of a MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or 

10 FLJ20313 polypeptide is initially injected into any of a wide variety of mammals (e.g., mice, 
rats, rabbits, sheep or goats). Alternatively, in order to generate antibodies to relatively short 
peptide portions of MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 , a 
superior immune response may be elicited if the polypeptide is joined to an immunogenic 
carrier, such as ovalbumin, BSA, KLH, pre-S HBsAg, other viral or eukaryotic proteins, and 

15 the like. The peptide-conjugate is injected into the animal host, preferably according to a 
predetermined schedule incorporating one or more booster immunizations, and the animals 
are bled periodically. Polyclonal antibodies specific for the polypeptide may then be purified 
from such anti-sera by, for example, affinity chromatography using the polypeptide coupled 
to a suitable solid support. 

20 Alternatively, for monoclonal antibodies, hybridomas may be formed by isolating the 

stimulated immune cells, such as those from the spleen of the inoculated animal. These 
cells are then fused to immortalized cells, such as myeloma cells or transformed cells, which 
are capable of replicating indefinitely in cell culture, thereby producing an immortal, 
immunoglobulin-secreting cell line. The immortal cell line utilized is preferably selected to be 

25 deficient in enzymes necessary for the utilization of certain nutrients. Many such cell lines 
(such as myelomas) are known to those skilled in the art, and include, for example: thymidine 
phosphatase (TK) or hypoxanthine-guanine phosphoriboxyl transferase (HGPRT). These 
deficiencies allow selection for fused cells according to their ability to grow on, for example, 
hypoxanthine aminopterinthymidine medium (HAT). 

30 Preferably, the immortal fusion partners utilized are derived from a line that does not 

secrete immunoglobulin. The resulting fused cells, or hybridomas, are cultured under 
conditions that allow for the survival of fused, but not unfused, cells and the resulting 
colonies screened for the production of the desired monoclonal antibodies. Colonies 
producing such antibodies are cloned, expanded, and grown so as to produce large 
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quantities of antibody, see Kohler and Milstein, Nature (1975) 256:495 (the disclosure of 
which is herein incorporated by reference). 

Large quantities of monoclonal antibodies from the secreting hybridomas may then 
be produced by injecting the clones into the peritoneal cavity of mice and harvesting the 
ascites fluid therefrom. The mice, preferably primed with pristine, or some other tumor- 
promoter, and-imraunosuppressed chemically or by irradiation, may be any of various 
suitable strains known to those in the art. The ascites fluid is harvested from the mice and 
the monoclonal antibody purified therefrom, for example, by CM Sepharose column 
chromatography or other chromatographic means. Alternatively, the hybridomas may be 
cultured in vitro or as suspension cultures. Batch, continuous culture, or other suitable 
culture processes may be utilized. Monoclonal antibodies are then recovered from the 
culture medium or supernatant. It is preferred that such antibodies by humanized or 
chimerized according to one of the procedures outlined below. 

In addition, the antibodies or antigen binding fragments may be produced by genetic 
engineering. In this technique, as with the standard hybridoma procedure, antibody- 
producing cells are sensitized to the desired antigen or immunogen. The messenger RNA 
isolated from the immune spleen cells or hybridomas is used as a template to make cDNA 
using PCR amplification. A library of vectors, each containing one heavy chain gene and 
one light chain gene retaining the Initial antigen specificity, is produced by insertion of 
appropriate sections of the amplified immunoglobulin cDNA into the expression vectors. A 
combinatorial library is constructed by combining the heavy chain gene library with the light 
chain gene library. This results in a library of clones which co-express a heavy and light 
chain (resembling the Fab fragment or antigen binding fragment of an antibody molecule). 
The vectors that carry these genes are co-transfected into a host (e.g. bacteria, insect cells, 
mammalian cells, or other suitable protein production host cell.). When antibody gene 
synthesis is induced in the transfected host, the heavy and light chain proteins self-assemble 
to produce active antibodies that can be detected by screening with the antigen or 
immunogen. 

Preferably, recombinant antibodies are produced in a recombinant protein production 
system which correctly glycosylates and processes the immunoglobulin chains, such as 
insect or mammalian cells, as is known in the art. 

Antibodies that have a reduced propensity to induce a violent or detrimental immune 
response in humans (such as anaphylactic shock), and which also exhibit a reduced 
propensity for priming an immune response which would prevent repeated dosage with the 
antibody therapeutic or imaging agent (e.g., the human-anti-murine-antibody "HAMA" 
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response), are preferred for use in the invention. Although some increased immune 
response against the tumor is desirable, the concurrent binding and inactivation of the 
therapeutic or imaging agent generally outweighs this benefit Thus, humanized, chimeric, or 
xenogenic human antibodies, which produce less of an immune response when administered 
to humans, are preferred for use in the present invention. 

^ ~ Chimeric antibodies may be made by recombinant means by combining, .the murine 
variable light and heavy chain regions (VK and VH), obtained from a murine (or other animal- 
derived) hybridoma clone, with the human constant light and heavy chain regions, in order to 
produce an antibody with predominantly human domains. The production of such chimeric 
antibodies is well known in the art, and may be achieved by standard means (as described, 
e.g., in U.S. Patent No. 5,624,659, incorporated fully herein by reference.) Humanized 
antibodies are engineered to contain even more human-like immunoglobulin domains, and 
incorporate only the complementarity-determining regions of the animal-derived antibody. 
This is accomplished by carefully examining the sequence of the hyper-variable loops of the 
variable regions of the monoclonal antibody, and fitting them to the structure of the human 
antibody chains. Although facially complex, the process is straightforward in practice. See, 
e.g., U.S. Patent No. 6,187,287, incorporated fully herein by reference. 

Alternatively, polyclonal or monoclonal antibodies may be produced from animals 
which have been genetically altered to produce human immunoglobulins, such as the 
Abgenix XenoMouse™ or the Medarex HuMAb ® technology. The transgenic animal may 
be produced by initially producing a "knock-out" animal which does not produce the animal's 
natural antibodies, and stably transforming the animal with a human antibody locus (e.g., by 
the use of a human artificial chromosome.) Only human antibodies are then made by the 
animal. Techniques for generating such animals, and deriving antibodies therefrom, are 
described in U.S. Patents No. 6,162,963 and 6,150,584. incorporated fully herein by 
reference. 

Alternatively, single chain antibodies (Fv, as described below) can be produced from 
phage libraries containing human variable regions (described in e.g. U.S. Patent No. 
6,174,708, incorporated fully herein by reference). 

In addition to entire immunoglobulins (or their recombinant counterparts), 
immunoglobulin fragments comprising the epitope binding site (e.g., Fab', F(ab') 2 , or other 
fragments) are useful as antibody moieties in the present invention. Such antibody 
fragments may be generated from whole immunoglobulins by ficin, pepsin, papain, or other 
protease cleavage. "Fragment,'' or minimal immunoglobulins may be designed utilizing 
recombinant immunoglobulin techniques. For instance "FV immunoglobulins for use in the 
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present invention may be produced by linking a variable light chain region to a variable heavy 
chain region via a peptide linker (e.g., poly-glycine or another sequence which does not form 

an alpha helix or beta sheet motif). 

Fv fragments are heterodimers of the variable heavy chain domain (V H ) and the 
variable light chain domain (V L ). The heterodimers of heavy and light chain domains that 
occur in whole IgG, for example, are connected by a disulfide bond-. ^Recombinant Fvs in 
which V H and V L are connected by a peptide linker are typically stable, see, for example, 
Huston et a/., Proc Natl Acad Sci USA (1988) 85:5879-5883 and Bird et al.. Science (1988) 
242:423-426, both fully incorporated herein, by reference. These are single chain Fvs which 
have been found to retain specificity and affinity and have been shown to be useful for 
imaging tumors and to make recombinant immunotoxins for tumor therapy. However, 
researchers have found that some of the single chain Fvs have a reduced affinity for antigen 
and the peptide linker can interfere with binding. Improved Fv's have also been made which 
comprise stabilizing disulfide bonds between the V H and V L regions, as described in U.S. 
Patent No. 6,147,203, incorporated fully herein by reference. Any of these minimal 
antibodies may be utilized in the present invention, and those which are humanized to avoid 
HAMA reactions are preferred for use in embodiments of the invention. 

In addition, derivatized immunoglobulins with added chemical linkers, detectable 
moieties (fluorescent dyes, enzymes, substrates, chemiluminescent moieties), or specific 
binding moieties (such as streptavidin, avidin, or biotin) may be utilized in the methods and 
compositions of the present invention. For convenience, the term "antibody" or "antibody 
moiety" will be used throughout to generally refer to molecules which specifically bind to a 
MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 epitope, although the term 
will encompass all immunoglobulins, derivatives, fragments, recombinant or engineered 
immunoglobulins, and modified immunoglobulins, as described above. 

Candidate anti-phosphatase antibodies can be tested for activity by any suitable 
standard means. As a first screen, the antibodies may be tested for binding against the 
antigen utilized to produce them, or against the entire extracellular domain or protein. As a 
second screen, candidates may be tested for binding to an appropriate cell line, or to primary 
tumor tissue samples. For these screens, the candidate antibody may be labeled for 
detection (e.g., with fluorescein or another fluorescent moiety, or with an enzyme such as 
horseradish peroxidase). After selective binding is established, the candidate antibody, or an 
antibody conjugate produced as described below, may be tested for appropriate activity {i.e., 
the ability to decrease tumor cell growth and/or to aid in visualizing tumor cells) in an in vivo 
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model, such as an appropriate cell line, or in a mouse or rat or mouse tumor model, as 
described above. 

QUANTITATION OF NUCLEIC ACIDS 
MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 nucleic acid 
reagents are used to screen patient samples.. e.g. biopsy-derived tumors, inflammatory 
samples such as arthritic synovium, ete., for amplified DNA in the cell, or increased 
expression of the corresponding mRNA or protein. DNA-based reagents are also designed 
for evaluation of chromosomal loci implicated in certain diseases e.g. for use in loss-of- 
heterozygosity (LOH) studies, or design of primers based on coding sequences. 

The polynucleotides of the invention can be used to detect differences in expression 
levels between two cells, e.g., as a method to identify abnormal or diseased tissue in a 
human. The tissue suspected of being abnormal or diseased can be derived from a different 
tissue type of the human, but preferably it is derived from the same tissue type; for example, 
an intestinal polyp or other abnormal growth should be compared with normal intestinal 
tissue. The normal tissue can be the same tissue as that of the test sample, or any normal 
tissue of the patient, especially those that express the polynucleotide-related gene of interest 
(e.g., brain, thymus, testis, heart, prostate, placenta, spleen, small intestine, skeletal muscle, 
pancreas, and the mucosal lining of the colon, efc). A difference between the 
polynucleotide-related gene. mRNA. or protein in the two tissues which are compared, for 
example, in molecular weight, amino acid or nucleotide sequence, or relative abundance, 
indicates a change in the gene, or a gene which regulates it. in the tissue of the human that 
was suspected of being diseased. 

The subject nucleic acid and/or polypeptide compositions may be used to analyze a 
patient sample for the presence of polymorphisms associated with a disease state. 
Biochemical studies may be performed to determine whether a sequence polymorphism in a 
coding region or control region is associated with disease, particularly cancers and other 
growth abnormalities. Diseases of interest may also include other hyperproliferative 
disorders. Disease associated polymorphisms may include deletion or truncation of the 
gene, mutations that alter expression level, that affect the binding activity of the protein, the 
phosphatase activity domain, efc. 

Changes in the promoter or enhancer sequence that may affect expression levels can 
be compared to expression levels of the normal allele by various methods known in the art. 
Methods for determining promoter or enhancer strength include quantitation of the expressed 
natural protein; insertion of the variant control element into a vector with a reporter gene such 
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as beta-galactosidase, luciferase, chloramphenicol acetyltransferase, etc. that provides for 
convenient quantitation; and the like. 

A number of methods are available for analyzing nucleic acids for the presence of a 
specific sequence, e.g. upregulated expression. Cells that express MKPX, PTP4A1 . PTPN7, 
5 FEM-2, DKFZP566K0524 or FLJ20313 may be used as a source of mRNA, which may be 
assayed directly or reversed anscribed into cDNA for analysis. The nucleic acid may be 
amplified by conventional techniques, such as the polymerase chain reaction (PCR), to 
provide sufficient amounts for analysis. The use of the polymerase chain reaction is 
described in Saiki era/. Science (1985) 239:487, and a review of techniques may be found in 
10 Sambrook et al. Molecular Cloni ng- A I ahoratorv Manual. CSH Press 1989, pp.14.2-14.33. 

A detectable label may be included in an amplification reaction. Suitable labels 
include fluorochromes, e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 
phycoerythrin, allophycocyanin,6-carboxyfluorescein(6-FAM),2,7-dimethoxy-4,5-dichloro-6- 
carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy-2,4,7,4,7- 
15 hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N,N-tetramethy!-6- 
carboxyrhodamine (TAMRA). radioactive labels, e.g. 32 P, ^S, 3 H; etc. The label may be a two 
stage system, where the amplified DNA is conjugated to biotin, haptens, etc. having a high 
affinity binding partner, e.g. avidin, specific antibodies, etc., where the binding partner is 
conjugated to a detectable label. The label may be conjugated to one or both of the primers. 
20 Alternatively, the pool of nucleotides used in the amplification is labeled, so as to incorporate 
the label into the amplification product. 

The sample nucleic acid, e.g. amplified or cloned fragment, is analyzed by one of a 
number of methods known in the art. Probes may be hybridized to Northern or dot blots, or 
liquid hybridization reactions performed. The nucleic acid may be sequenced by dideoxy or 
25 other methods, and the sequence of bases compared to a wild-type sequence. Single strand 
conformational polymorphism (SSCP) analysis, denaturing gradient gel electrophoresis 
(DGGE). and heteroduplex analysis in gel matrices are used to detect conformational 
changes created by DNA sequence variation as alterations in electrophoretic mobility. 
Fractionation is performed by gel or capillary electrophoresis, particularly acrylamide or 
30 agarose gels. 

Arrays provide a high throughput technique that can assay a large number of 
polynucleotides in a sample. In one aspect of the invention, an array is constructed 
comprising MKPX. PTP4A1. PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 in conjunction 
with other cancer associated sequences, particularly cancer associated phosphatases. This 
35 technology can be used as a tool to test for differential expression. 
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A variety of methods of producing arrays, as well as variations of these methods, are 
known in the art and contemplated for use in the invention. For example, arrays can be 
created by spotting polynucleotide probes onto a substrate (e.g., glass, nitrocellulose, efc.) in 
a two-dimensional matrix or array having bound probes. The probes can be bound to the 
substrate by either covalent bonds or by non-specific interactions, such as hydrophobic 
interactions. Samples of nucleic acids can be detectably labeled (e.g., using, radioactive or 
fluorescent labels) and then hybridized to the probes. Double stranded nucleic acids, 
comprising the labeled sample polynucleotides bound to probe nucleic acids, can be 
detected once the unbound portion of the sample is washed away. Alternatively, the nucleic 
acids of the test sample can be immobilized on the array, and the probes detectably labeled. 

Techniques for constructing arrays and methods of using these arrays are described 
in. for example. Schena ef a/., Proc Natl Acad Sci U S A (1996) 93(20):10614-9; Schena ef 
a/., Science (1995) 270(5235):467-70; Shalon etal., Genome Res (1996) 6(7):639-45, United 
States Patent Nos. 5,807,522; 5,593,839; 5,578,832; 5,631,734; 5,599,695; and 5,556,752; 
EP 799 897; WO 97/29212; WO 97/27317; EP 785 280; WO 97/02357; EP 728 520; EP 
721 016; and WO 95/22058. 

Arrays can be used to, for example, examine differential expression of genes and can 
be used to determine gene function. For example, arrays can be used to detect differential 
expression of SEQ ID NOS:1, 3, 5, 7, 9 or 1 1, where expression is compared between a test 
cell and control cell (e.g., cancer cells and normal cells). High expression of a particular 
message in a cancer cell, which is not observed in a corresponding normal cell, indicates a 
cancer specific gene product. Exemplary uses of arrays are further described in, for 
example, Pappalarado ef a/., Sem Radiation Oncol (1998) 8:217; and Ramsay, Nature 
Biotechnol (1998) 16:40. Furthermore, many variations on methods of detection using arrays 
are well within the skill in the art and within the scope of the present invention. For example, 
rather than immobilizing the probe to a solid support, the test sample can be immobilized on 
a solid support that is then contacted with the probe. 

Polypeptide Analysis 
Screening for expression of the subject sequences may be based on the functional or 
antigenic characteristics of the protein. Protein truncation assays are useful in detecting 
deletions that may affect the biological activity of the protein. Various immunoassays 
designed to detect polymorphisms in MKPX. PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or 
FLJ20313 may be used in screening. Where many diverse genetic mutations lead to a 
particular disease phenotype, functional protein assays have proven to be effective 
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screening tools. The activity of the encoded protein in phosphatase assays, efc.. may be 
determined by comparison with the wild-type protein. 

A sample is taken from a patient with cancer. Samples, as used herein, include 
biological fluids such as blood; organ or tissue culture derived fluids; efc. Biopsy samples or 
other sources of carcinoma cells are of particular interest, e.g. tumor biopsy, efc. Also 
included in the term are derivatives and fractions of such- ceils and fluids. The number of 
cells in a sample will generally be at least about 10 3 , usually at least 10 4 , and may be about 
10 5 or more. The cells may be dissociated, in the case of solid tissues, or tissue sections 
may be analyzed. Alternatively a lysate of the cells may be prepared. 

Detection may utilize staining of cells or histological sections, performed in 
accordance with conventional methods. The antibodies or other specific binding members of 
interest are added to the cell sample, and incubated for a period of time sufficient to allow 
binding to the epitope, usually at least about 10 minutes. The antibody may be labeled with 
radioisotopes, enzymes, fluorescers, chemiluminescers, or other labels for direct detection. 
Alternatively, a second stage antibody or reagent is used to amplify the signal. Such 
reagents are well known in the art. For example, the primary antibody may be conjugated to 
biotin, with horseradish peroxidase-conjugated avidin added as a second stage reagent. 
Final detection uses a substrate that undergoes a color change in the presence of the 
peroxidase. The absence or presence of antibody binding may be determined by various 
methods, including flow cytometry of dissociated cells, microscopy, radiography, scintillation 
counting, efc. 

An alternative method for diagnosis depends on the in vitro detection of binding 
between antibodies and the MKPX, PTP4A1, PTPN7, FEM-2. DKFZP566K0524 or 
FLJ20313 in a lysate. Measuring the concentration of the target protein in a sample or 
fraction thereof may be accomplished by a variety of specific assays. A conventional 
sandwich type assay may be used. For example, a sandwich assay may first attach specific 
antibodies to an insoluble surface or support. The particular manner of binding is not crucial 
so long as it is compatible with the reagents and overall methods of the invention. They may 
be bound to the plates covaiently or non-covalently, preferably non-covalently. 

The insoluble supports may be any compositions to which polypeptides can be 
bound, which is readily separated from soluble material, and which is otherwise compatible 
with the overall method. The surface of such supports may be solid or porous and of any 
convenient shape. Examples of suitable insoluble supports to which the receptor is bound 
include beads, e.g. magnetic beads, membranes and microtiter plates. These are typically 
made of glass, plastic (e.g. polystyrene), polysaccharides, nylon or nitrocellulose. Microtiter 
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plates are especially convenient because a large number of assays can be carried out 
simultaneously, using small amounts of reagents and samples. 

Patient sample lysates are then added to separately assayable supports (for 
example, separate wells of a microtiter plate) containing antibodies. Preferably, a series of 
standards, containing known concentrations of the test protein is assayed in parallel with the 
samples or aliquots thereof to seive as-eontrols. Preferably, each sample and standard will 
be added to multiple wells so that mean values can be obtained for each. The incubation 
time should be sufficient for binding, generally, from about 0.1 to 3 hr is sufficient. After 
incubation, the insoluble support is generally washed of non-bound components. Generally, 
a dilute non-ionic detergent medium at an appropriate pH, generally 7-8, is used as a wash 
medium. From one to six washes may be employed, with sufficient volume to thoroughly 
wash non-specifically bound proteins present in the sample. 

After washing, a solution containing a second antibody is applied. The antibody will 
bind to a polypeptide of the invention with sufficient specificity such that it can be 
distinguished from other components present. The second antibodies may be labeled to 
facilitate direct, or indirect quantification of binding. Examples of labels that permit direct 
measurement of second receptor binding include radiolabels, such as 3 H or 125 l, fluorescers, 
dyes, beads, chemilumninescers, colloidal particles, and the like. Examples of labels that 
permit indirect measurement of binding include enzymes where the substrate may provide 
for a colored or fluorescent product. In a preferred embodiment, the antibodies are labeled 
with a covalently bound enzyme capable of providing a detectable product signal after 
addition of suitable substrate. Examples of suitable enzymes for use in conjugates include 
horseradish peroxidase, alkaline phosphatase, malate dehydrogenase and the like. Where 
not commercially available, such antibody-enzyme conjugates are readily produced by 
techniques known to those skilled in the art. The incubation time should be sufficient for the 
labeled ligand to bind available molecules. Generally, from about 0.1 to 3 hr is sufficient, 
usually 1 hr sufficing. 

After the second binding step, the insoluble support is again washed free of non- 
specifically bound material, leaving the specific complex formed between the target protein 
and the specific binding member. The signal produced by the bound conjugate is detected 
by conventional means. Where an enzyme conjugate is used, an appropriate enzyme 
substrate is provided so a detectable product is formed. 

Other immunoassays are known in the art and may find use as diagnostics. 
Ouchterlony plates provide a simple determination of antibody binding. Western blots may 
be performed on protein gels or protein spots on filters, using a detection system specific for 
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one of the proteins of the invention as desired, conveniently using a labeling method as 

described for the sandwich assay. 

In some cases, a competitive assay will be used. In addition to the patient sample, a 
competitor to the targeted protein is added to the reaction mix. The competitor and the 
selected phosphatase compete for binding to the specific binding partner. Usually, the 
competitor molecule-will be labeled and detected as previously described, where the amount 
of competitor binding will be proportional to the amount of target protein present The 
concentration of competitor molecule will be from about 10 times the maximum anticipated 
protein concentration to about equal concentration in order to make the most sensitive and 

linear range of detection. 

In some embodiments, the methods are adapted for use in vivo, e.g., to locate or 
identify sites where cancer cells are present. In these embodiments, a detectably-labeled 
moiety, e.g., an antibody, which is specific for MKPX, PTP4A1, PTPN7, FEM-2, 
DKFZP566K0524 or FLJ20313 is administered to an individual (e.g., by injection), and 
labeled cells are located using standard imaging techniques, including, but not limited to, 
magnetic resonance imaging, computed tomography scanning, and the like. In this manner, 
cancer cells are differentially labeled. 

The detection methods can be provided as part of a kit. Thus, the invention further 
provides kits for detecting the presence of a MKPX, PTP4A1, PTPN7, FEM-2, 
DKFZP566K0524 or FLJ20313 mRNA, and/or a polypeptide encoded thereby, in a biological 
sample. Procedures using these kits can be performed by clinical laboratories, experimental 
laboratories, medical practitioners, or private individuals. The kits of the invention for 
detecting a polypeptide comprise a moiety that specifically binds the polypeptide, which may 
be a specific antibody. The kits of the invention for detecting a nucleic acid comprise a 
moiety that specifically hybridizes to such a nucleic acid. The kit may optionally provide 
additional components that are useful in the procedure, including, but not limited to. buffers, 
developing reagents, labels, reacting surfaces, means for detection, control samples, 
standards, instructions, and interpretive information. 

SAMPLES FOR ANALYSIS 
Sample of interest include tumor tissue, e.g. excisions, biopsies, blood samples 
where the tumoris metastatic, efc. Of particular interest are solid tumors, e.g. carcinomas, 
and include, without limitation, tumors of the liver and colon. Liver cancers of interest include 
hepatocellular carcinoma (primary liver cancer). Also called hepatoma, this is the most 
common form of primary liver cancer. Chronic infection with hepatitis B and C increases the 
risk of developing this type of cancer. Other causes include cancer-causing substances. 
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alcoholism, and chronic liver cirrhosis. Other liver cancers of interest for analysis by the 
subject methods include hepatocellular adenoma, which are benign tumors occurring most 
often in women of childbearing age; hemangioma, which are a type of benign tumor 
comprising a mass of abnormal blood vessels, cholangiocarcinoma, which originates in the 
lining of the bile channels in the liver or in the bile ducts; hepatoblastoma, which is common 
-in infants and children; angiosarcoma, which is a rare cancer that originates in_the-biood 
vessels of the liver; and bile duct carcinoma and liver cysts. Cancers originating in the lung, 
breast, colon, pancreas and stomach and blood cells commonly are found in the liver after 
they become metastatic. 

Also of interest are colon cancers. Types of polyps of the colon and rectum include 
polyps, which are any mass of tissue that arises from the bowel wall and protrudes into the 
lumen. Polyps may be sessile or pedunculated and vary considerably in size. Such lesions 
are classified histologically as tubular adenomas, tubulovillous adenomas (villoglandular 
polyps), villous (papillary) adenomas (with or without adenocarcinoma), hyperplastic polyps, 
hamartomas, juvenile polyps, polypoid carcinomas, pseudopolyps, lipomas, leiomyomas, or 
other rarer tumors. 

SCREENING METHODS 

Target Screening. Reagents specific for MKPX, PTP4A1, PTPN7, FEM-2, 
DKFZP566K0524 or FLJ20313 are used to identify targets of the encoded protein in tumor 
cells. For example, one of the nucleic acid coding sequences may be introduced into a 
tumor cell using an inducible expression system. Suitable positive and negative controls are 
included. Transient transfection assays, e.g. using adenovirus vectors, may be performed. 
The cell system allows a comparison of the pattern of gene expression in transformed cells 
with or without expression of the phosphatase. Alternatively, phosphorylation patterns after 
induction of expression are examined. Gene expression of putative target genes may be 
monitored by Northern blot or by probing microarrays of candidate genes with the test 
sample and a negative control where gene expression of the phosphatase is not induced. 
Patterns of phosphorylation may be monitored by incubation of the cells or lysate with 
labeled phosphate, followed by 1 or 2 dimensional protein gel analysis, and identification of 
the targets by MALDI, micro-sequencing, Western blot analysis, efc, as known in the art. 

Some of the potential target genes of the MKPX, PTP4A1, PTPN7, FEM-2, 
DKF2P566K0524 or FLJ20313 phosphatases identified by this method will be secondary or 
tertiary in a complex cascade of gene expression or signaling. To identify primary targets of 
the subject phosphatase activation, expression or phosphorylation will be examined early 
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after induction of expression (within 1-2 hours) or after blocking later steps in the cascade 
with cycloheximide. 

Target genes or proteins identified by this method may be analyzed for expression in 
primary patient samples as well. The data for the MKPX, PTP4A1 , PTPN7, FEM-2, 
5 DKFZP566K0524 or FLJ20313 and target gene expression may be analyzed using 
statistical analysis to establish a correlation. — ~" - 

Compound Screening. The availability of a number of components in signaling 
pathways allows in vitro reconstruction of the pathway, and/or assessent of phosphatase 

10 action on targets. Two or more of the components may be combined in vitro, and the 
behavior assessed in terms of activation of transcription of specific target sequences; 
modification of protein components, e.g. proteolytic processing, phosphorylation, 
methylation, etc.; ability of different protein components to bind to each other efc. The 
components may be modified by sequence deletion, substitution, etc. to determine the 

15 functional role of specific domains. 

Compound screening may be performed using an in vitro model, a genetically altered 
cell or animal, or purified MKPX. PTP4A1 , PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 
protein. One can identify ligands or substrates that bind to, modulate or mimic the action of 
the encoded polypeptide. Areas of investigation include the development of treatments for 

20 hyper-proliferative disorders, e.g. cancer, restenosis, osteoarthritis, metastasis, etc. 

The polypeptides include those encoded by SEQ ID NOS:1, 3, 5, 7, 9 or 11, as well 
as nucleic acids that, by virtue of the degeneracy of the genetic code, are not identical in 
sequence to the disclosed nucleic acids, and variants thereof. Variant polypeptides can 
include amino acid (aa) substitutions, additions or deletions. The amino acid substitutions 

25 can be conservative amino acid substitutions or substitutions to eliminate non-essential 
amino acids, such as to alter a glycosylation site, a phosphorylation site or an acetylation 
site, or to minimize misfolding by substitution or deletion of one or more cysteine residues 
that are not necessary for function. Variants can be designed so as to retain or have 
enhanced biological activity of a particular region of the protein (e.g., a functional domain 

30 and/or, where the polypeptide is a member of a protein family, a region associated with a 
consensus sequence). Variants also include fragments of the polypeptides disclosed herein, 
particularly biologically active fragments and/or fragments corresponding to functional 
domains. Fragments of interest will typically be at least about 10 aa to at least about 15 aa 
in length, usually at least about 50 aa in length, and can be as long as 300 aa in length or 

35 longer, but will usually not exceed about 500 aa in length, where the fragment will have a 
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contiguous stretch of amino acids that is identical to a polypeptide encoded by SEQ ID 
NOS:2, 4, 6, 8, 10 or 12, or a homolog thereof. 

Transgenic animals or cells derived therefrom are also used in compound screening. 
Transgenic animals may be made through homologous recombination, where the normal 
locus corresponding to SEQ ID NOS:1, 3, 5, 7, 9 or 11 is altered. Alternatively, a nucleic 
acid construct is randomly integrated into the genome. Vectors for stable integration include 
plasmids, retroviruses and other animal viruses, YACs, and the like. A series of small 
deletions and/or substitutions may be made in the coding sequence to determine the role of 
different exons in phosphatase activity, oncogenesis, signal transduction, efc. Of interest is 
the use of SEQ ID NOS:1, 3, 5, 7, 9 or 11 to construct transgenic animal models for cancer, 
where expression of the corresponding phosphatase is specifically reduced or absent. 
Specific constructs of interest include antisense sequences that block expression of the 
targeted gene and expression of dominant negative mutations. A detectable marker, such as 
lac Z may be introduced into the locus of interest, where up-regulation of expression will 
result in an easily detected change in phenotype. One may also provide for expression of 
the target gene or variants thereof in cells or tissues where it is not normally expressed or at 
abnormal times of development. By providing expression of the target protein in cells in 
which it is not normally produced, one can induce changes in cell behavior, e.g. in the control 
of cell growth and tumorigenesis. 

Compound screening identifies agents that modulate function of MKPX, PTP4A1, 
PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 . Agents that mimic its function are 
predicted to activate the process of cell division and growth. Conversely, agents that inhibit 
function may inhibit transformation. Of particular interest are screening assays for agents 
that have a low toxicity for human cells. A wide variety of assays may be used for this 
purpose, including labeled in vitro protein-protein binding assays, electrophoretic mobility 
shift assays, immunoassays for protein binding, and the like. Knowledge of the 3- 
dimensional structure of the encoded protein, derived from crystallization of purified 
recombinant protein, could lead to the rational design of small drugs that specifically inhibit 
activity. These drugs may be directed at specific domains, e.g. the phosphatase catalytic 
domain, the regulatory domain, the auto-inhibitory domain, etc. 

The term "agent" as used herein describes any molecule, e.g. protein or 
pharmaceutical, with the capability of altering or mimicking the physiological function of 
MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313. Generally a plurality of 
assay mixtures are run in parallel with different agent concentrations to obtain a differential 
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response to the various concentrations. Typically one of these concentrations serves as a 
negative control, i.e. at zero concentration or below the level of detection. 

Candidate agents encompass numerous chemical classes, though typically they are 
organic molecules, preferably small organic compounds having a molecular weight of more 
5 than 50 and less than about 2,500 daltons. Candidate agents comprise functional groups 
necessary for structural interaction with proteins, particularly hydrogen bonding, and typically, 
include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the 
functional chemical groups. The candidate agents often comprise cyclical carbon or 
heterocyclic structures and/or aromatic or polyaromatic structures substituted with one or 
10 more of the above functional groups. Candidate agents are also found among biomolecules 
including peptides, saccharides, fatly acids, steroids, purines, pyrimidines, derivatives, 
structural analogs or combinations thereof. 

Candidate agents are obtained from a wide variety of sources including libraries of 
synthetic or natural compounds. For example, numerous means are available for random 
15 and directed synthesis of a wide variety of organic compounds and biomolecules, including 
expression of randomized oligonucleotides and oligopeptides. Alternatively, libraries of 
natural compounds in the form of bacterial, fungal, plant and animal extracts are available or 
readily produced. Additionally, natural or synthetically produced libraries and compounds are 
readily modified through conventional chemical, physical and biochemical means, and may 
20 be used to produce combinatorial libraries. Known pharmacological agents may be 
subjected to directed or random chemical modifications, such as acylation, alkylation, 
esterification, amidification, etc. to produce structural analogs. 

Where the screening assay is a binding assay, one or more of the molecules may be 
joined to a label, where the label can directly or indirectly provide a detectable signal. 
25 Various labels include radioisotopes, fluorescers, chemiluminescers, enzymes, specific 
binding molecules, particles, e.g. magnetic particles, and the like. Specific binding molecules 
include pairs, such as biotin and streptavidin, digoxin and antidigoxin. etc. For the specific 
binding members, the complementary member would normally be labeled with a molecule 
that provides for detection, in accordance with known procedures. 
30 A variety of other reagents may be included in the screening assay. These include 

reagents like salts, neutral proteins, e.g. albumin, detergents, efc. that are used to facilitate 
optimal protein-protein binding and/or reduce non-specific or background interactions. 
Reagents that improve the efficiency of the assay, such as protease inhibitors, nuclease 
inhibitors, anti-microbial agents, etc. may be used. The mixture of components are added in 
35 any order that provides for the requisite binding. Incubations are performed at any suitable 
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temperature, typically between 4 and 40° C. Incubation periods are selected for optimum 
activity, but may also be optimized to facilitate rapid high-throughput screening. Typically 
between 0.1 and 1 hours will be sufficient 

Other assays of interest detect agents that mimic the function of MKPX, PTP4A1, 
5 PTPN7, FEM-2, DKFZP566K0524 or FLJ20313. For example, an expression construct 
comprising the gene may be introduced into a cell line - under conditions that allow 
expression. The level of phosphatase activity is determined by a functional assay, for 
example detection of protein phosphorylation. Alternatively, candidate agents are added to a 
cell that lacks MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313, and 

10 screened for the ability to reproduce the activity in a functional assay. 

The compounds having the desired pharmacological activity may be administered in 
a physiologically acceptable carrier to a host for treatment of cancer, eto. The compounds 
may also be used to enhance function in wound healing, cell growth, eto. The inhibitory 
agents may be administered in a variety of ways, orally, topically, parenterally e.g. 

15 subcutaneously, intraperitoneally, by viral infection, intravascularly, eto. Depending upon the 
manner of introduction, the compounds may be formulated in a variety of ways. The 
concentration of therapeutically active compound in the formulation may vary from about 0.1- 
10wt%. 

20 Formulations. The compounds of this invention can be incorporated into a variety of 

formulations for therapeutic administration. Particularly, agents that modulate MKPX, 
PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 activity are formulated for 
administration to patients for the treatment of cells where the target activity is undesirably 
high or low, e.g. to reduce the level of activity in cancer cells. More particularly, the 

25 compounds of the present invention can be formulated into pharmaceutical compositions by 
combination with appropriate, pharmaceutical^ acceptable carriers or diluents, and may be 
formulated into preparations in solid, semi-solid, liquid or gaseous forms, such as tablets, 
capsules, powders, granules, ointments, solutions, suppositories, injections, inhalants, gels, 
microspheres, and aerosols. As such, administration of the compounds can be achieved in 

30 various ways, including oral, buccal, rectal, parenteral, intraperitoneal, intradermal, 
transdermal, intra-tracheal, eto., administration. The agent may be systemic after 
administration or may be localized by the use of an implant that acts to retain the active dose 
at the site of implantation. 

In pharmaceutical dosage forms, the compounds may be administered in the form of 

35 their pharmaceutical^ acceptable salts, or they may also be used alone or in appropriate 
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association, as well as in combination with other pharmaceutically active compounds. The 
following methods and excipients are merely exemplary and are In no way limiting. 

For oral preparations; the compounds can be used alone or in combination with 
appropriate additives to make tablets, powders, granules or capsules, for example, with 

5 conventional additives, such as lactose, mannitol, com starch or potato starch; with binders, 
such- as crystalline cellulose, cellulose derivatives, ac3eia, com starch or gelatins; with — 
disintegrators, such as corn starch, potato starch or sodium carboxymethylcellulose; with 
lubricants, such as talc or magnesium stearate; and if desired, with diluents, buffering 
agents, moistening agents, preservatives and flavoring agents. 

10 The compounds can be formulated into preparations for injections by dissolving, 

suspending or emulsifying them in an aqueous or nonaqueous solvent, such as vegetable or 
other similar oils, synthetic aliphatic acid glycerides, esters of higher aliphatic acids or 
propylene glycol; and if desired, with conventional additives such as solubilizers, isotonic 
agents, suspending agents, emulsifying agents, stabilizers and preservatives. 

15 The compounds can be utilized in aerosol formulation to be administered via 

inhalation. The compounds of the present invention can be formulated into pressurized 
acceptable propellants such as dichlorodifluoromethane, propane, nitrogen and the like. 

Furthermore, the compounds can be made into suppositories by mixing with a variety 
of bases such as emulsifying bases or water-soluble bases. The compounds of the present 

20 invention can be administered rectally via a suppository. The suppository can include 
vehicles such as cocoa butter, carbowaxes and polyethylene glycols, which melt at body 
temperature, yet are solidified at room temperature. 

Unit dosage forms for oral or rectal administration such as syrups, elixirs, and 
suspensions may be provided wherein each dosage unit, for example, teaspoonful, 

25 tablespoonful, tablet or suppository, contains a predetermined amount of the composition 
containing one or more compounds of the present invention. Similarly, unit dosage forms for 
injection or intravenous administration may comprise the compound of the present invention 
in a composition as a solution in sterile water, normal saline or another pharmaceutically 
acceptable carrier. 

30 Implants for sustained release formulations are well-known in the art. Implants are 

formulated as microspheres, slabs, etc. with biodegradable or non-biodegradable polymers. 
For example, polymers of lactic acid and/or glycolic acid form an erodible polymer that is 
well-tolerated by the host. The implant is placed in proximity to the site of disease, so that the 
local concentration of active agent Is increased relative to the rest of the body. 
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The term "unit dosage form," as used herein, refers to physically discrete units 
suitable as unitary dosages for human and animal subjects, each unit containing a 
predetermined quantity of compounds of the present invention calculated in an amount 
sufficient to produce the desired effect in association with a pharmaceutlcally acceptable 
diluent, carrier or vehicle. The specifications for the novel unit dosage forms of the present 
invention depend on the particulaPbompound employed and the effect to be achieved, and 
the pharmacodynamics associated with each compound in the host. 

The pharmaceutically acceptable excipients, such as vehicles, adjuvants, carriers or 
diluents, are readily available to the public. Moreover, pharmaceutically acceptable auxiliary 
substances, such as pH adjusting and buffering agents, tonicity adjusting agents, stabilizers, 
wetting agents and the like, are readily available to the public. 

Typical dosages for systemic administration range from 0.1 ug to 100 milligrams per 
kg weight of subject per administration. A typical dosage may be one tablet taken from two 
to six times daily, or one time-release capsule or tablet taken once a day and containing a 
proportionally higher content of active ingredient. The time-release effect may be obtained 
by capsule materials that dissolve at different pH values, by capsules that release slowly by 
osmotic pressure, or by any other known means of controlled release. 

Those of skill will readily appreciate that dose levels can vary as a function of the 
specific compound, the severity of the symptoms and the susceptibility of the subject to side 
effects. Some of the specific compounds are more potent than others. Preferred dosages for 
a given compound are readily determinable by those of skill in the art by a variety of means. 
A preferred means is to measure the physiological potency of a given compound. 

The use of liposomes as a delivery vehicle is one method of interest. The liposomes 
fuse with the cells of the target site and deliver the contents of the lumen intracellularly. The 
liposomes are maintained in contact with the cells for sufficient time for fusion, using various 
means to maintain contact, such as isolation, binding agents, and the like. In one aspect of 
the invention, liposomes are designed to be aerosolized for pulmonary administration. 
Liposomes may be prepared with purified proteins or peptides that mediate fusion of 
membranes, such as Sendai virus or influenza virus, etc. The lipids may be any useful 
combination of known liposome forming lipids, including cationic lipids, such as 
phosphatidylcholine. The remaining lipid will normally be neutral lipids, such as cholesterol, 
phosphatidyl serine, phosphatidyl glycerol, and the like. 

MODULATION OF ENZYME ACTIVITY 
Agents that block activity of MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or 
FLJ20313 provide a point of intervention in an important signaling pathway. Numerous 

30 



WO 03/083102 PCT/CA03/00393 

agents are useful in reducing this activity, including agents that directly modulate expression 
as described above, e.g. expression vectors, antisense specific for the targeted 
phosphatase; and agents that act on the protein, e.g. specific antibodies and analogs 
thereof, small organic molecules that block catalytic activity, efc. 
5 The genes, gene fragments, or the encoded protein or protein fragments are useful in 

therapy to~freaV "disorders associated with defects in sequence or expression. From a 
therapeutic point of view, inhibiting activity has a therapeutic effect on a number of 
proliferative disorders, including inflammation, restenosis, and cancer. Inhibition is achieved 
in a number of ways. Antisense sequences may be administered to inhibit expression. 
10 Pseudo-substrate inhibitors, for example, a peptide that mimics a substrate for the 
phosphatase may be used to inhibit activity. Other inhibitors are identified by screening for 
biological activity in a functional assay, e.g. in vitro or in vivo phosphatase activity. 

Expression vectors may be used to introduce the target gene into a cell. Such 
vectors generally have convenient restriction sites located near the promoter sequence to 
15 provide for the insertion of nucleic acid sequences. Transcription cassettes may be prepared 
comprising a transcription initiation region, the target gene or fragment thereof, and a 
transcriptional termination region. The transcription cassettes may be introduced into a 
variety of vectors, e.g. plasmid; retrovirus, e.g. lentivirus; adenovirus; and the like, where the 
vectors are able to transiently or stably be maintained in the cells, usually for a period of at 
20 least about one day, more usually for a period of at least about several days to several 
weeks. 

The gene or protein may be introduced into tissues or host cells by any number of 
routes, including viral infection, microinjection, or fusion of vesicles. Jet injection may also be 
used for intramuscular administration, as described by Furth et a/., Anal Biochem (1992) 

25 205:365-368. The DNA may be coated onto gold microparticles, and delivered intradermally 
by a particle bombardment device, or "gene gun" as described in the literature (see, for 
example, Tang et a/., Nature (1992) 356:152-154), where gold micro-projectiles are coated 
with the protein or DNA, then bombarded into skin cells. 

Antisense molecules can be used to down-regulate expression in cells. The 

30 antisense reagent may be antisense oligonucleotides (ODN), particularly synthetic ODN 
having chemical modifications from native nucleic acids, or nucleic acid constructs tha 
express such antisense molecules as RNA. The antisense sequence is complementary tc 
the mRNA of the targeted gene, and inhibits expression of the targeted gene products 
Antisense molecules inhibit gene expression through various mechanisms, e.g. by reducinc 

35 the amount of mRNA available for translation, through activation of RNAse H, or steri< 
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hindrance. One or a combination of antisense molecules may be administered, where a 
combination may comprise multiple different sequences. 

Antisense molecules may be produced by expression of all or a part of the target 
gene sequence in an appropriate vector, where the transcriptional initiation is oriented such 
that an antisense strand is produced as an RNA molecule. Alternatively, the antisense 
molecule is a synthetic oligonucleotide. Antisense oligonucleotides- will generally be at least 
about 7, usually at least about 12, more usually at least about 20 nucleotides in length, and 
not more than about 500, usually not more than about 50, more usually not more than about 
35 nucleotides in length, where the length is governed by efficiency of inhibition, specificity, 
including absence of cross-reactivity, and the like. It has been found that short 
oligonucleotides, of from 7 to 8 bases in length, can be strong and selective inhibitors of 
gene expression (see Wagner et a/., Nature Biotechnology (1996) 14:840-844). 

A specific region or regions of the endogenous sense strand mRNA sequence is 
chosen to be complemented by the antisense sequence. Selection of a specific sequence 
for the oligonucleotide may use an empirical method, where several candidate sequences 
are assayed for inhibition of expression of the target gene in vitro or in an animal model. A 
combination of sequences may also be used, where several regions of the mRNA sequence 
are selected for antisense complementation. 

Antisense oligonucleotides may be chemically synthesized by methods known in the 
art (see Wagner ef a/. (1993) supra, and Milligan et a/., supra.) Preferred oligonucleotides 
are chemically modified from the native phosphodiester structure, in order to increase their 
intracellular stability and binding affinity. A number of such modifications have been 
described in the literature, which alter the chemistry of the backbone, sugars or heterocyclic 
bases. 

Among useful changes in the backbone chemistry are phosphorothioates; 
phosphorodithioates, where both of the non-bridging oxygens are substituted with sulfur; 
phosphoroamidites; alkyl phosphotriesters and boranophosphates. Achiral phosphate 
derivatives include S'-O'-S'-S-phosphorothioate, S'-S-S'-O-phosphorothioate, 3'-CH2-5'-0- 
phosphonate and 3'-NH-5'-0-phosphoroamidate. Peptide nucleic acids replace the entire 
ribose phosphodiester backbone with a peptide linkage. Sugar modifications are also used to 
enhance stability and affinity. The a-anomer of deoxyribose may be used, where the base is 
inverted with respect to the natural p-anomer. The Z-OH of the ribose sugar may be altered 
to form Z-O-methyl or 2-O-allyl sugars, which provides resistance to degradation without 
comprising affinity. Modification of the heterocyclic bases must maintain proper base pairing. 
Some useful substitutions include deoxyuridine for deoxythymidine; 5-methyl-2*- 
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deoxycytidine and 5-bromo-2'-deoxycytidine for deoxycytidine. S-propynyW-deoxyuridine 
and 5-propynyl-2'-deoxycytidine have been shown to increase affinity and biological activity 
when substituted for deoxythymidine and deoxycytidine, respectively. 

5 THERAPEUTIC AND IMAGING ANTIBODIES 

Anti-phosphatase antibodies find_fpr use therapeutic and imaging purposes. Such 
antibodies, which may be selected as described above, may be utilized without further 
modification to include a cytotoxic or imaging moiety, or may be modified by conjugation to 
include such cytotoxic or imaging agents. 

10 As used herein, "cytotoxic moiety" (C) simply means a moiety that inhibits cell growth 

or promotes cell death when proximate to or absorbed by the cell. Suitable cytotoxic 
moieties in this regard include radioactive isotopes (radionuclides), chemotoxic agents such 
as differentiation inducers and small chemotoxic drugs, toxin proteins, and derivatives 
thereof. As utilized herein, "imaging moiety" (I) means a moiety which can be utilized to 

15 increase contrast between a tumor and the surrounding healthy tissue in a visualization 
technique (e.g., radiography, positron-emission tomography, magnetic resonance imaging, 
direct or indirect visual inspection.) Thus, suitable imaging moieties include radiography 
moieties (e.g. heavy metals and radiation emitting moieties), positron emitting moieties, 
magnetic resonance contrast moieties, and optically visible moieties (e.g., fluorescent or 

20 visible-spectrum dyes, visible particles, etc.). It will be appreciated by one of ordinary skill 
that some overlap exists between what is a therapeutic moiety and what is an imaging 
moiety. For instance 212 Pb and 212 Bi are both useful radioisotopes for therapeutic 
compositions, but are also electron-dense, and thus provide contrast for X-ray radiographic 
imaging techniques, and can also be utilized in scintillation imaging techniques. 

25 In general, therapeutic or imaging agents may be conjugated to the anti-phosphatase 

moiety by any suitable technique, with appropriate consideration of the need for 
pharmokinetic stability and reduced overall toxicity to the patient. A therapeutic agent may 
be coupled to a suitable antibody moiety either directly or indirectly (e.g. via a linker group). 
A direct reaction between an agent and an antibody is possible when each possesses a 

30 functional group capable of reacting with the other. For example, a nucleophilic group, such 
as an amino or sulfhydryl group, may be capable of reacting with a carbonyl-containing 
group, such as an anhydride or an acid halide, or with an alkyl group containing a good 
leaving group (e.g., a halide). Alternatively, a suitable chemical linker group may be used. A 
linker group can function as a spacer to distance an antibody from an agent in order to avoid 

35 interference with binding capabilities. A linker group can also serve to increase the chemical 
reactivity of a substituent on a moiety or an antibody, and thus increase the coupling 
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efficiency. An increase in chemical reactivity may also facilitate the use of moieties, or 
functional groups on moieties, which otherwise would not be possible. 

Suitable linkage chemistries include maleimidyl linkers and alkyl halide linkers (which 
react with a sulfhydryl on the antibody moiety) and succinimidyl linkers (which react with a 

5 primary amine on the antibody moiety). Several primary amine and sulfhydryl groups are 
present on immunoglobulins, and additional groups may be designed into recombinant 
immunoglobulin molecules. It will be evident to those skilled in the art that a variety of 
bifunctional or polyfunctional reagents, both homo- and hetero-functional (such as those 
described in the catalog of the Pierce Chemical Co., Rockford, III.), may be employed as a 

10 linker group. Coupling may be effected, for example, through amino groups, carboxyl 
groups, sulfhydryl groups or oxidized carbohydrate residues. There are numerous 
references describing such methodology, e.g., U.S. Patent No. 4,671 ,958. As an alternative 
coupling method, cytotoxic or imaging moieties may be coupled to the antibody moiety 
through an oxidized carbohydrate group at a glycosylation site, as described in U.S. Patents 

15 No. 5,057,313 and 5,156,840. Yet another alternative method of coupling the antibody 
moiety to the cytotoxic or imaging moiety is by the use of a non-covalent binding pair, such 
as streptavidin/biotin, or avidin/biotin. In these embodiments, one member of the pair 
covalently coupled to the antibody moiety and the other member of the binding pair is 
covalently coupled to the cytotoxic or imaging moiety. 

20 Where a cytotoxic moiety is more potent when free from the antibody portion of the 

immunoconjugates of the present invention, it may be desirable to use a linker group that is 
cleavable during or upon internalization into a cell, or that is gradually cleavable over time In 
the extracellular environment. A number of different cleavable linker groups have been 
described. The mechanisms for the intracellular release of a cytotoxic moiety agent from 

25 these linker groups include cleavage by reduction of a disulfide bond (e.g., U.S. Patent No. 
4,489,710), by irradiation of a photolabile bond (e.g., U.S. Patent No. 4,625,014), by 
hydrolysis of derivatized amino acid side chains (e.g., U.S. Patent No. 4,638,045), by serum 
complement-mediated hydrolysis (e.g., U.S. Patent No. 4,671,958), and acid-catalyzed 
hydrolysis (e.g., U.S. Patent No. 4,569,789). 

30 It may be desirable to couple more than one cytotoxic and/or imaging moiety to an 

antibody. By poly-derivatizing the antibody, several cytotoxic strategies may be 
simultaneously implemented, an antibody may be made useful as a contrasting agent for 
several visualization techniques, or a therapeutic antibody may be labeled for tracking by a 
visualization technique. In one embodiment, multiple molecules of an imaging or cytotoxic 

35 moiety are coupled to one antibody molecule. In another embodiment, more than one type 
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of moiety may be coupled to one antibody. Regardless of the particular embodiment, 
immunoconjugates with more than one moiety may be prepared in a variety of ways. For 
example, more than one moiety may be coupled directly to an antibody molecule, or linkers 
which provide multiple sites for attachment (e.g., dendrimers) can be used. Alternatively, a 
carrier with the capacity to hold more than one cytotoxic or imaging moiety can be used. 
_ A carrier may bear the agents in a variety of ways, including covalent bonding either- 
directly or via a linker group, and non-covalent associations. Suitable covalent-bond carriers 
include proteins such as albumins (e.g., U.S. Patent No. 4,507,234), peptides, and 
polysaccharides such as aminodextran (e.g., U.S. Patent No. 4,699,784), each of which have 
multiple sites for the attachment of moieties. A carrier may also bear an agent by non- 
covalent associations, such as non-covalent bonding or by encapsulation, such as within a 
liposome vesicle (e.g., U.S. Patents Nos. 4,429,008 and 4,873,088). Encapsulation carriers 
are especially useful for imaging moiety conjugation to antibody moieties for use in the 
invention, as a sufficient amount of the imaging moiety (dye, magnetic resonance contrast 
reagent, etc.) for detection may be more easily associated with the antibody moiety. In 
addition, encapsulation carriers are also useful in chemotoxic therapeutic embodiments, as 
they can allow the therapeutic compositions to gradually release a chemotoxic moiety over 
time while concentrating it in the vicinity of the tumor cells. 

Carriers and linkers specific for radionuclide agents (both for use as cytotoxic 
moieties or positron-emission imaging moieties) include radiohalogenated small molecules 
and chelating compounds. For example, U.S. Patent No. 4,735,792 discloses representative 
radiohalogenated small molecules and their synthesis. A radionuclide chelate may be 
formed from chelating compounds that include those containing nitrogen and sulfur atoms as 
the donor atoms for binding the metal, or metal oxide, radionuclide. For example, U.S. 
Patent No. 4,673,562, to Davison et al. discloses representative chelating compounds and 
their synthesis. Such chelation carriers are also useful for magnetic spin contrast ions for 
use in magnetic resonance imaging tumor visualization methods, and for the chelation of 
heavy metal ions for use in radiographic visualization methods. 

Preferred radionuclides for use as cytotoxic moieties are radionulcides which are 
suitable for pharmacological administration. Such radionuclides include 123 l, 126 l, 131 l, 90 Y, 
211 At, 67 Cu, 186 Re, 1fl8 Re, 212 Pb, and 212 Bi. Iodine and astatine isotopes are more preferred 
radionuclides for use in the therapeutic compositions of the present invention, as a large 
body of literature has been accumulated regarding their use. 131 l is particularly preferred, as 
are other p-radiation emitting nuclides, which have an effective range of several millimeters. 
123 l, 125 l, 131 l, or 211 At may be conjugated to antibody moieties for use in the compositions and 
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methods utilizing any of several known conjugation reagents, including lodogen, A/- 
succinimidyl 3-[ 211 At]astatobenzoate, AA-succinimidy! 3-[ 131 l]iodobenzoate (SIB), and , N- 
succinimidyl 5-[ 131 l]iodob-3-pyridinecarboxyIate (SIPC). Any iodine isotope may be utilized in 
the recited iodo-reagents. For example, a suitable antibody for use in the present invention 
5 may be easily made by coupling an Fab fragment of the BD Transduction Labs R20720 anti- 
SEQ ID NOS:2, 4, 678, 10 or 12 MAb with 131 l lodogen according-tc-the manufacturer's 
instructions. Other radionuclides may be conjugated to anti-SEQ ID NOS:2, 4, 6, 8, 10 or 12 
antibody moieties by suitable chelation agents known to those of skill in the nuclear medicine 
arts. 

10 Preferred chemotoxic agents include small-molecule drugs such as methotrexate, 

and pyrimidine and purine analogs. Preferred chemotoxin differentiation inducers include 
phorbol esters and butyric acid. Chemotoxic moieties may be directly conjugated to the 
antibody moiety via a chemical linker, or may encapsulated in a carrier, which is in turn 
coupled to the antibody moiety. 

15 Preferred toxin proteins for use as cytotoxic moieties include ricin, abrin, diphtheria 

toxin, cholera toxin, gelonin, Pseudomonas exotoxin, Shigella toxin, pokeweed antiviral 
protein, and other toxin proteins known in the medicinal biochemistry arts. As these toxin 
agents may elicit undesirable immune responses in the patient, especially if injected 
intravascularly, it is preferred that they be encapsulated in a carrier for coupling to the 

20 antibody moiety. 

Preferred radiographic moieties for use as imaging moieties in the present invention 
include compounds and chelates with relatively large atoms, such as gold, iridium, 
technetium, barium, thallium, iodine, and their isotopes. It is preferred that less toxic 
radiographic imaging moieties, such as iodine or iodine isotopes, be utilized in the 

25 compositions and methods of the invention. Examples of such compositions which may be 
utilized for x-ray radiography are described in U.S. Patent No. 5,709,846, incorporated fully 
herein by reference. Such moieties may be conjugated to the anti-SEQ ID NOS:2, 4, 6, 8, 10 
or 12 antibody moiety through an acceptable chemical linker or chelation carrier. Positron 
emitting moieties for use in the present invention include 18 F, which can be easily conjugated 

30 by a fluorination reaction with the antibody moiety according to the method described in U.S. 
Patent No. 6,187,284. 

Preferred magnetic resonance contrast moieties include chelates of chromium(lll), 
manganese(ll), iron(ll), nickel(ll), copper(II), praseodymium(lll), neodymium(lll), 
samarium(lll) and ytterbium(lll) ion. Because of their very strong magnetic moment, the 

35 gadolinium(lll), terbium(lll), dysprosium(lll), holmium(lll), erbium(lll), and iron(lll) ions are 
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especially preferred. Examples of such chelates, suitable for magnetic resonance spin 
imaging, are described in U.S. Patent No. 5,733,522, incorporated fully herein by reference. 
Nuclear spin contrast chelates may be conjugated to the antibody moieties through a suitable 
chemical linker. 

5 Optically visible moieties for use as imaging moieties include fluorescent dyes, or 

visible-spectrum dyes, visible particles, and other-visible labeling moieties. Fluorescent dyes- 
such as fluorescein, coumarin, rhodamine, bodipy Texas red, and cyanine dyes, are useful 
when sufficient excitation energy can be provided to the site to be inspected visually. 
Endoscopic visualization procedures may be more compatible with the use of such labels. 
10 For many procedures where imaging agents are useful, such as during an operation to 
resect a brain tumor, visible spectrum dyes are preferred. Acceptable dyes include FDA- 
approved food dyes and colors, which are non-toxic, although pharmaceutically acceptable 
dyes that have been approved for internal administration are preferred. In some 
embodiments, such dyes are encapsulated in carrier moieties, which are in turn conjugated 
15 to the antibody. Alternatively, visible particles, such as colloidal gold particles or latex 
particles, may be coupled to the antibody moiety via a suitable chemical linker. 

For administration, the antibody-therapeutic or antibody-imaging agent will generally 
be mixed, prior to administration, with a non-toxic, pharmaceutically acceptable carrier 
substance. Usually, this will be an aqueous solution, such as normal saline or phosphate- 
20 buffered saline (PBS), Ringer's solution, lactate-Ringer's solution, or any isotonic 
physiologically acceptable solution for administration by the chosen means. Preferably, the 
solution is sterile and pyrogen-free, and is manufactured and packaged under current Good 
Manufacturing Processes (GMP), as approved by the FDA or HPB. The clinician of ordinary 
skill is familiar with appropriate ranges for pH, tonicity, and additives or preservatives when 
25 formulating pharmaceutical compositions for administration by intravascular injection, 
intrathecal injection, injection into the cerebro-spinal fluid, direct injection into the tumor, or by 
other routes. In addition to additives for adjusting pH or tonicity, the antibody-therapeutics 
and antibody-imaging agents may be stabilized against aggregation and polymerization with 
amino acids and non-ionic detergents, polysorbate, and polyethylene glycol. Optionally, 
30 additional stabilizers may include various physiologically-acceptable carbohydrates and salts. 
Also, polyvinylpyrrolidone may be added in addition to the amino acid. Suitable therapeutic 
immunoglobulin solutions which are stabilized for storage and administration to humans are 
described in U.S. Patent No. 5,945,098. incorporated fully herein by reference. Other agents, 
such as human serum albumin (HSA), may be added to the therapeutic or imaging 
35 composition to stabilize the antibody conjugates. Antibodies coupled to cytotoxic moieties will 
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recognize their targets within the body, where the cytotoxic moiety is brought in contact to or 
in close proximity to the a tumor, whereupon the cytotoxic moiety interferes with the tumor 
and reduces it's growth, reduces is size, prevents metastasis, or otherwise kills the cells in 
the tumor. Antibodies coupled to Imaging moieties will recognize their targets within the body, 
5 whereupon their targets can be visualized using suitable methods described above, as is 
appropriate fcr-the imaging moiety used. 

EXAMPLES 

The following examples are put forth so as to provide those of ordinary skill in the art 
10 with a complete disclosure and description of how to make and use the present invention, 
and are not intended to limit the scope of what the inventors regard as their invention nor are 
they intended to represent that the experiments below are all or the only experiments 
performed. Efforts have been made to ensure accuracy with respect to numbers used (e.g. 
amounts, temperature, etc.) but some experimental errors and deviations should be 
15 accounted for. Unless indicated otherwise, parts are parts by weight, molecular weight is 
weight average molecular weight, temperature is in degrees Centigrade, and pressure is at 
or near atmospheric. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 

20 specifically and individually indicated to be incorporated by reference. 

The present invention has been described in terms of particular embodiments found 
or proposed by the present inventor to comprise preferred modes for the practice of the 
invention. It will be appreciated by those of skill in the art that, in light of the present 
disclosure, numerous modifications and changes can be made in the particular embodiments 

25 exemplified without departing from the intended scope of the invention. For example, due to 
codon redundancy, changes can be made in the underlying DNA sequence without affecting 
the protein sequence. Moreover, due to biological functional equivalency considerations, 
changes can be made in protein structure without affecting the biological action in kind or 
amount. All such modifications are intended to be included within the scope of the appended 

30 claims. 

Example 1 
Identification of phosphatase sequences 

35 The Genbank database was searched for ESTs showing similarity to known 

phosphatase domain-related proteins using the "basic local alignment search tool" program, 
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TBLASTN, with default settings. Human ESTs identified as having similarity to these known 
phosphatase domains (defined as p < 0.0001) were used in a BLASTN and BLASTX screen 
of the Genbank non-redundant (NR) database. 

ESTs that had top human hits with >95% identity over 100 amino acids were 
discarded. The remaining BLASTN and BLASTX outputs for each EST were examined 
manually, i.e., ESTs were removed from the analysis if the inventors determined that the 
variation from the known phosphatase domain-related probe sequence was a result of poor 
database sequence. Poor database sequence was usually identified as a number of 'N' 
nucleotides in the database sequence for a BLASTN search and as a base deletion or 
insertion in the database sequence, resulting in a peptide frameshift, for a BLASTX output. 
ESTs for which the highest scoring match was to non-phosphatase domain-related 
sequences were also discarded at this stage. 

Using widely known algorithms, e.g. "Smith/Waterman", "FastA", "FastP", 
"Needleman/Wunsch", "Blast", "PSIBIast," homology of the subject nucleic acid to other 
known nucleic acids was determined. A "Local FastP Search" algorithm was performed in 
order to determine the homology of the subject nucleic acid invention to known sequences. 
Then, a ktup value, typically ranging from 1 to 3 and a segment length value, typically 
ranging from 20 to 200, were selected as parameters. Next, an array of position for the 
probe sequence was constructed in which the cells of the array contain a list of positions of 
that substring of length ktup. For each subsequence in the position array, the target 
sequence was matched and augmented the score array cell corresponding to the diagonal 
defined by the target position and the probe subsequence position. A list was then 
generated and sorted by score and report. The criterion for perfect matches and for 
mismatches was based on the statistics properties of that algorithm and that database, 
typically the values were: 98% or more match over 200 nucleotides would constitute a match; 
and any mismatch in 20 nucleotides would constitute a mismatch. 

Analysis of the BLASTN and BLASTX outputs identified an EST sequence from an 
IMAGE clone that had potential for being associated with a sequence encoding a 
phosphatase domain-related protein, e.g., the sequence had homology, but not identity, to 
known phosphatase domain-related proteins. 

After identification of phosphatase ESTs, the clones were added to Kinetek's clone 
bank for analysis of gene expression in tumor samples. Gene expression work involved 
construction of unigene clusters, which are represented by entries in the "pks" database. A 
list of accession numbers for members of the clusters were assigned. Subtraction of the 
clusters already present in the clone bank from the clusters recently added left a list of 
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clusters that had not been previously represented in Kinetek's clone bank. For each of the 
clusters, a random selection of an EST IMAGE accession numbers were chosen to represent 
the clusters. For each of the clusters which did not have an EST IMAGE clone, generation of 
a report so that clone ordering or construction could be implemented was performed on a 
5 case by case basis. A list of accession numbers which were not in clusters was constructed 
and a report was generated. — - 

The identified IMAGE clones were sequenced using standard ABI dye-primer and 
dye-terminator chemistry on a 377 automatic DNA sequencer. 

10 Example 2 

Fvnrpssinn Analysis of MKPX. PTP4A1 . PTPN7. FEM-2. DKFZP566K0524 and FLJ20313 

The expression of MKPX. PTP4A1 , PTPN7, FEM-2. DKFZP566K0524 and FLJ20313 
was determined by dot blot analysis, and the proteins were found to be upregulated in 

15 several tumor samples. 

Dot blot preparation. Total RNA was purified from clinical cancer and control samples 
taken from the same patient. Samples were used from colon tumors. Using reverse 
transcriptase, cDNAs were synthesized from these RNAs. Radiolabeled cDNA was 
synthesized using Strip-EZ™ kit (Ambion, Austin, TX) according to the manufacturer's 

20 instructions. These labeled, amplified cDNAs were then used as a probe, to hybridize to 
human phosphatase arrays comprising human MKPX, PTP4A1, PTPN7, FEM-2, 
DKFZP566K0524 and FLJ20313 sequences. The amount of radiolabeled probe hybridized 
to each arrayed EST clone was detected using phosphorimaging. The expression of these 
genes was substantially upregulated in at least one of the tumor tissues tested. Samples are 

25 taken from the colon, prostate, breast, kidney, uterine, kidney, stomach, bladder, leukemia, 
cervical tumors, using dot blots or RT-PCR, expression of MKPX. PTP4A1, PTPN7. FEM-2, 
DKFZP566K0524 and FLJ20313 is examined. 

Example 3 

30 Antisense regulation of MKPX PTP4A1 . PTP N7. FEM-2. DKFZP566K0524 or FLJ20313 

expression 

Additional functional information on MKPX, PTP4A1, PTPN7, FEM-2, 
DKFZP566K0524 or FLJ20313 is generated using antisense knockout technology. MKPX, 
35 PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 expression in cancerous cells is 
further analyzed to confirm the role and function of the gene product in tumorgenesis, e.g., in 
promoting a metastatic phenotype. 
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A number of different oligonucleotides complementary to MKPX, PTP4A1, PTPN7, 
FEM-2, DKFZP566K0524 or FLJ20313 mRNA are designed as potential antisense 
oligonucleotides, and tested for their ability to suppress expression of one of the peptides of 
the Invention.. The ability of each designed antisense oligonucleotide to inhibit gene 
expression is tested through transfection into SW620 colon colorectal carcinoma cells, or 
cells from any other cell lines such as-A54S-(Lung carcinoma), B16-F1 (Melanoma), DLD?1 
(Colon carcinoma), LS-180 (Colon carcinoma), PC3 (Prostate carcinoma), U87 (Glioma), 
MCF-7 (Mammary carcinoma), Huvec (normal human endothelial). Hs-27 (normal lung 
fibroblast) and MCF-10a (Mammary epithelial). For each transfection mixture, a carrier 
molecule, preferably a lipitoid or cholesteroid, is prepared to a working concentration of 0.5 
mM in water, sonicated to yield a uniform solution, and filtered through a 0.45 urn PVDF 
membrane. The antisense or control oligonucleotide is then prepared to a working 
concentration of 100 uM in sterile Millipore water. The oligonucleotide is further diluted in 
OptiMEM™ (Gibco/BRL), in a microfuge tube, to 2 uM, or approximately 20 ug oligo/ml of 
OptiMEM™. In a separate microfuge tube, lipitoid or cholesteroid, typically in the amount of 
about 1.5-2 nmol lipitoid/ug antisense oligonucleotide, is diluted into the same volume of 
OptiMEM™ used to dilute the oligonucleotide. The diluted antisense oligonucleotide is 
immediately added to the diluted lipitoid and mixed by pipetting up and down. 
Oligonucleotide is added to the cells to a final concentration of 30 nM. The level of target 
mRNA in the transfected cells is quantitated in the cancer cell lines using the Roche 
LightCycler™ real-time PCR machine. Values for the target mRNA is normalized versus an 
internal control (e.g., beta-actin). 

The antisense oligonucleotides are introduced into a test cell and the effect upon 
MKPX, PTP4A1, PTPN7, FEM-2. DKFZP566K0524 or FLJ20313 expression, as well as the 
effect upon induction of the cancerous phenotype, is examined as described below. 

Example 4 

Ffferts of MKPX. PTP4A1- PTPN7. FEM-2. DKFZP5 66K0524 ORFLJ20313 antisense 

polynucleotides on cell proliferation 

The effect of MKPX, PTP4A1, PTPN7. FEM-2, DKFZP566K0524 or FLJ20313 
antisense polynucleotides.on proliferation is assessed in the cancer cell lines listed above. 
Transfection is carried out as described above in Example 4, except the final concentration of 
oligonucleotide for all experiments is 300 nM, and the final ratio of oligo to delivery vehicle for 
all experiments is 1.5 nmol lipitoid/ug oligonucleotide. Cells are transfected overnight at 
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37°C and the transfection mixture is replaced with fresh medium the next morning. 
Proliferation is measured visually and the effects of antisense polynucleotides on cell 
proliferation are determined. 

5 Example 5 

FffprtR of MKPX. PTP4A1 . PTPN7- FEM-2. DKFZ P566K0524 OR FL.I20313 antisense 

polynucleotides on colony formation 

The effect of MKPX, PTP4A1 , PTPN7, FEM-2. DKFZP566K0524 or FLJ20313 
10 antisense polynucleotides on colony formation is tested in a soft agar assay. Soft agar 
assays are conducted by first establishing a bottom layer of 2 ml of 0.6% agar in media 
plated fresh within a few hours of layering on the cells. The cell layer is formed on the 
bottom layer by removing cells transfected as described above from plates using 0.05% 
trypsin and washing twice in media. The cells are counted in a Coulter counter, and 
15 resuspended to 10 6 per ml in media. 10 ul aliquots are placed with media in 96-well plates, 
or diluted further for soft agar assay. Cells are plated in 0.4% agar in duplicate wells above 
0.6% agar bottom layer. After the cell layer agar solidifies, 2 ml of media is dribbled on top 
and antisense or reverse control oligo is added without delivery vehicles. Colonies are 
formed in 10 days to 3 weeks. Fields of colonies are counted by eye and the effects of 
20 antisense polynucleotides on colony formation can be determined. 

Example 6 

Induction of cell death upon depletion of M KPX. PTP4A1 . PTPN7. FEM-2, DKFZP566K0524 

or FLJ20313 

25 . . . 

Cells are transfected as described for proliferation assays. Each day, cytotoxicity is 

monitored by measuring the amount of LDH enzyme released in the medium due to 
membrane damage. The activity of LDH is measured using the Cytotoxicity Detection Kit 
from Roche Molecular Biochemicals. The data is provided as a ratio of LDH released in the 
30 medium vs. the total LDH present in the well at the same time point and treatment 
(rLDH/tLDH). 

Example 7 

Assay for agents that modulate MKPX. PTP4A1 . P TPN7. FEM-2. DKFZP566K0524 or 
35 FLJ20313 activity 



MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 is expressed as a 
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6x His tag fusion protein using the baculovirus system, purified using affinity 
chromatography, and phosphatase assays are performed as described in Ausubel et al 1999 
(Short protocols in molecular biology; John Wiley and Sons, NY). 

Agents modulating MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 
activity can be identified by comparing the activity of one of the phosphatases in the 
presence of a candidate agent to the activity of the same phosphatase in the absence ota 
candidate agent. 
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WHAT IS CLAIMED IS: 

1. A method of screening for biologically active agents that modulate a cancer 
associated phosphatase function, the method comprising: combining a candidate biologically 
active agent with any one of: 

(a) a polypeptide encoded by SEQ ID NOS:1, 3, 5, 7, 9 or 11; or having the amino acid 
sequence set forth in SEQ ID NOS:2 f 4, 6, 8, 1 0~or 1 2;~ 

(b) a cell comprising a nucleic acid encoding a polypeptide encoded by SEQ ID NOS:1, 3, 
5, 7, 9 or 11; or 

(c) a non-human transgenic animal model for cancer associated phosphatase gene 
function comprising one of: (i) a knockout of a gene corresponding to SEQ ID NOS:1 , 3, 5, 7, 9 or 
1 1 ; (ii) an exogenous and stably transmitted mammalian gene sequence comprising polypeptide 
encoded by SEQ ID NOS:1 t 3, 5, 7, 9 or 11; and 

determining the effect of said agent on phosphatase function. 

2. A method for the diagnosis of cancer, the method comprising: 
determining the upregulation of expression in SEQ ID NOS:1, 3, 5, 7, 9 or 11 in said cancer. 

3. The method of Claim 2, wherein said cancer is a liver cancer. 

4. The method of Claim 2, wherein said cancer is a colon cancer. 

5. The method of Claim 2, wherein said determining comprises detecting the 
presence of increased amounts of mRNA in said cancer. 

6. The method of Claim 2, wherein said determining comprises detecting the 
presence of increased amounts of protein in said cancer. 

7. A method for inhibiting the growth of a cancer cell, the method comprising 
downregulating activity of the polypeptide encoded by SEQ ID NOS:1, 3, 5, 7, 9 or 11; or having 
the amino acid sequence set forth in SEQ ID NOS:2, 4, 6, 8, 10 or 12; in said cancer cell. 

8. The method according to Claim 7, wherein said method comprises introducing 
antisense sequences specific for SEQ ID NOS:1, 3, 5, 7, 9 or 11. 
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9. The method according to Claim 7, wherein said method comprises introducing an 
inhibitor of phosphatase activity into said cancer cell. 

1 0. The method according to Claim 7, wherein said cancer cell is a liver cancer cell. 

1 1 . The method according to Claim 7, wherein said cancer cell is a colon cancer cell. 

12. A method of screening for targets of a cancer associated phosphatase, wherein 
said targets are associated with signal transduction in cancer cells, the method comprising: 

comparing the pattern of gene expression in a normal cell, and in a tumor cell 
characterized by up-regulation of SEQ ID NOS:1, 3, 5, 7, 9 or 1 1. 

13. The method according to Claim 12, wherein said comparing the pattern of gene 
expression comprises quantitating specific mRNAs by hybridization to an array of polynucleotide 
probes. 

14. A method of screening for targets of a cancer associated phosphatase, wherein 
said targets are associated with signal transduction in cancer cells, the method comprising: 

comparing the pattern of protein phosphorylation in a normal cell, and in a tumor cell 
characterized by up-regulation of SEQ ID NOS:1 , 3, 5, 7, 9 or 1 1 . 

15. The method according to claim 12 or claim 14, wherein said signal transduction 
involves activation MKPX, PTP4A1 , PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 . 

16. An isolated nucleic acid comprising the sequence set forth in SEQ ID NOS:1, 3, 5, 
7, 9 or 11. 

17. A method to treat a tumor comprising administering a therapeutic amount of a 
composition comprising: 

a compound of the general formula <x(P z )C, wherein cc(P z ) is one or more moieties which 
specifically binds to a human protein MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or 
FLJ20313, and C is one or more cytotoxic moieties; 
and a pharmaceutically acceptable carrier. 



45 



WO 03/083102 




PCT/CA03/00393 



18. The method of claim 17 wherein the therapeutic composition is administered by 
intravascular administration. 

19. The method of claim 17 wherein the tumor is a colon, pancreas, lung or ovarian 

tumor. 

20. The method of claim 17 wherein cc(Pz) is selected from the group consisting of an 
antibody and an antibody fragment. 

21 . The method of claim 20 wherein the antibody is selected from the group consisting 
of monoclonal antibodies, polyclonal antibodies, humanized antibodies, recombinant antibodies, 
chemically modified antibodies, and synthetic antibody analogs. 

22. The method of claim 17 wherein C is a radioactive moiety. 

23. The method of claim 17 wherein the radioactive moiety comprises a 
pharmaceutically acceptable radioactive isotope selected from the group consisting of 123 l, 125 l, 

131, Oy 211 At 67 Cu 186 Re 1B8 Re 212 pb and 212 Bj 

24. The method of claim 17 wherein C is a chemotoxic moiety. 

25. The method of claim 24 wherein the chemotoxic moiety is selected from the group 
consisting of methotrexate, a pyrimidine analog, a purine analog, a phorbol ester, and butyric 
acid. 

26. The method of claim 17 wherein C is a toxin protein moiety. 

27. The method of claim 26 wherein the toxin protein moiety is selected from the 
group consisting of ricin, abrin, diphtheria toxin, cholera toxin, gelonin, Pseudomonas exotoxin, 
Shigella toxin, and pokeweed antiviral protein. 

28. A compound for the treatment of a tumor of the general formula ct(P z )C, wherein 
<x(P z ) is one or more moieties which specifically binds to human MKPX, PTP4A1, PTPN7, FEM-2, 
DKFZP566K0524 or FLJ20313 protein, and C is one or more cytotoxic moieties. 

29. The compound of claim 28 wherein a(Pj) is selected from the group consisting of 
an antibody and an antibody fragment. 
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30. The compound of claim 29 wherein the antibody is selected from the group 
consisting of monoclonal antibodies, polyclonal antibodies, humanized antibodies, recombinant 
antibodies, chemically modified antibodies, and synthetic antibody analogs. 

31 . The compound of claim 28 wherein C is a radioactive moiety. 

32. The compound of claim 31 wherein the radioactive moiety comprises a 
pharmaceutical^ acceptable radioactive isotope selected from the group consisting of 123 l, 125 l, 

131, 80 Yf 211 At 87 CUf 186 Re 188 Re 212 pb an(j 212 Bj 

33. The compound of claim 28 wherein C is a chemotoxic moiety. 

34. The compound of claim 33 wherein the chemotoxic moiety is selected from the 
group consisting of methotrexate, a pyrimidine analog, a purine analog, a phorbol ester, and 
butyric acid. 

35. The compound of claim 28 wherein C is a toxin protein moiety. 

36. The compound of claim 35 wherein the toxin protein moiety is selected from the 
group consisting of ricin, abrin, diphtheria toxin, cholera toxin, gelonin. Pseudomonas exotoxin. 
Shigella toxin, and pokeweed antiviral protein. 

37. A method for treating a tumor comprising administering a therapeutic amount of a 
composition comprising: a compound of the general formula a(Pz), wherein a(Pz) is one or more 
moieties which specifically binds to a human protein MKPX, PTP4A1, PTPN7, FEM-2, 
DKFZP566K0524 or FLJ20313 , wherein the binding of <x(Pz) alters the function of the human 
protein, and a pharmaceutically acceptable carrier. 

38. The method of claim 37 wherein the therapeutic composition is administered by 
intravascular administration. 

39. The method of claim 37 wherein the tumor is a colon, pancreas, lung or ovarian 

tumor. 

40. The method of claim 37 wherein a(P z ) is selected from the group consisting of an 
antibody and an antibody fragment. 

41 . A composition for the treatment of a tumor comprising: a compound of the general 
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formula a(P z ), wherein a(Pz) is one or more moieties which specifically binds to a human MKPX, 
PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 , wherein the binding of a(Pz) alters the 
function of the protein, and a pharmaceutically acceptable earner. 

42. The composition of claim 41 wherein ct(P z ) is selected from the group consisting of 
an antibody and an antibody fragment. 

43. A method for visualizing a tumor in a patient, the method comprising: a) 
administering to a patient an effective amount of a composition comprising: a compound of the 
general formula <x(P z )l, wherein a(Pz) is one or more moieties which specifically binds to a human 
MKPX, PTP4A1, PTPN7, FEM-2, DKFZP566K0524 or FLJ20313 protein, and I is one or more 
imaging moieties; and a pharmaceutically acceptable carrier; and b) visualizing the imaging 
moieties of the compound. 

44. The method of claim 43 wherein the imaging composition is administered by 
intravascular administration. 

45. The method of claim 43 wherein the tumor is a colon, pancreas, lung or ovarian 

tumor. 

46. The method of claim 43 wherein a(Pz) is selected from the group consisting of an 
antibody and an antibody fragment. 

47. The method of claim 43 wherein I is a radiographic moiety. 

48. The method of claim 43 wherein the radiographic moiety comprises iodine or an 
iodine isotope. 

49. The method of claim 43 wherein the visualizing step (b) comprises x-ray imaging. 

50. The method of claim 43 wherein the visualizating step (b) comprises scintillation 
imaging. 

51 . The method of claim 43 wherein I is a positron-emitting moiety. 

52. The method of claim 43 wherein the positron-emitting moiety comprises 18 F. 

53. The method of claim 43 wherein the visualizating step (b) comprises positron 
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emission tomography. 

54. The method of claim 43 wherein I is a magnetic spin contrast moiety. 

55. The method of claim 54 wherein the magnetic spin contrast moiety comprises an 
ion selected from the group consisting of chromium(lll), manganese(ll), iron(ll). nickel(ll), 
copper(ll), praseodymium(lll), neodymium(lll), samarium(lll) and ytterbium(lll). 

56. The method of claim 43 wherein the visualizing step (b) comprises magnetic 
resonance imaging. 

57. The method of claim 43 wherein I is selected from the group consisting of an 
optically visible dye and an optically visible particle. 

58. The method of claim 43 wherein the visualizing step (b) comprises direct visual 
inspection. 

59. The method of claim 43 wherein the visualizing in step (b) comprises visual 
inspection through an endoscopic instrument. 
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SEQUENCE LISTING 
<110> Delaney, Allen 

<120> Cancer Associated Protein Phosphatases and their 
uses 

<130> KINE-040PRV 

<140> unknown 
<141> 

<160> 12 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 1520 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> (0) . ■ . (0) 

<223> MKPX polynucleotide 

<400> 1 

ggcacgaggc cgagcctagt 
gcggagtggg gcgcggcagc 
gcgaggaggg acgctgggcc 
catttcccgg cataggctcc 
cgcacagaaa gccctgccct 
tccctgccgc tgacttgttg 
gcccgccccg ccgctcctcc 
ggggcgctag cgttcgcctt 
cctgtacatc ggcaacttca 
gacacatatt ctgtctgtcc 
gtgcatccca gcagcggatt 
taaattcatt cacgagtgcc 
ggtctccagg agcgtgacac 
ggaggatgcc ctgcacaccg 
ccagagacag ctccaggagt 
ggaagaatat ggagagagcc 
tccaggaatt ctgaagttct 
aaatattgca aacccacaga 
ttaagtatcc agtagtgatt 
tcatgtttat gttgagaact 
cccactgctg tggaggtttc 
gccgcactgc cttgtgggtg 
gacgagccca gtgtagttgt 
ttgggaaata gcccttggtg 
accgaaagct ctatgttttc 
aaaaaaaaaa aaaaaaaaaa 



gcctcccacg cccggcggcc gcgagccggg gtccgcgagg 60 

caggaacccg actacgaatc ccagggtgcg ggcgggcgga 120 

tgcccggtgc gcacgggggc ggggaccggc aaggcgggac 180 

ggtgcccctg cccggctccc gccgggaagt tctaggccgc 240 

ccacgccggg tctctggagc gccctgggtt gcccggccgg 3 00 

acactgcgag cactcagtcc ctcccgcgcg cctcctcccc 360 

tccctgtaac atgccatagt gcgcctgcga ccacacggcc 42 0 

cagccaccat ggggaatggg atgaacaaga tcctgcccgg 4 80 

aagatgccag agacgcggaa caattgagca agaacaaggt 540 

atgatagtgc caggcctatg ttggagggag ttaaatacct 600 

caccatctca aaacctgaca agacatttca aagaaagtat 660 

ggctccgcgg tgagagctgc cttgtacact gcctggccgg 72 0 

tggtgatcgc atacatcatg accgtcactg actttggctg 780 

tgcgtgctgg gagatcctgt gccaacccca acgtgggctt 840 

ttgagaagca tgaggtccat cagtatcggc agtggctgaa 900 

ctttgcagga tgcagaagaa gccaaaaaca ttctggccgc 960 

gggcctttct cagaagactg taatgtacct gaagtttctg 1020 

gtttaggctg gtgctgccaa aaagaaaagc aacatagagt 10 80 

tgtaaacttg tttttcattt gaagctgaat atatacgtag 1140 

aaggatattc tttagcaaga gaaaatattt tccccttatc 1200 

tgtacctcgc ttggatgcct gtaaggatcc cgggagcctt 12 60 

gcttggcgct cgtgattgct tcctgtgaac gcctcccaag 1320 

gtggcgtgaa ctctgcccgt gtgttctcaa attccccagc 13 80 

tgggttttat ctctggtttg tgttctccgt ggtggaattg 1440 

gttaataaag ggcaacttag ccaagtttaa aaaaaaaaaa 1500 

1520 



<210> 2 
<211> 184 
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<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 

<222> (0) . . . (0) 

<223> MKPX polypeptide 



<400> 2 














Met 


Gly 


Asn 


Gly 


Met 


Asn 


Lys 


He 


1 








5 








Phe 


Lys 


Asp 


Ala 


Arg 


Asp 


Ala 


Glu 








20 










His 


lie 


Leu 


Ser 


Val 


His 


Asp 


Ser 






35 










40 


Lys 


Tyr 


Leu 


Cys 


He 


Pro 


Ala 


Ala 




50 










55 




Arg 


His 


Phe 


Lys 


Glu 


Ser 


He 


Lys 


65 










70 






Gly 


Glu 


Ser 


Cys 


Leu 


Val 


His 


Cys 










85 








Thr 


Leu 


Val 


He 


Ala 


Tyr 


He 


Met 








100 










Asp 


Ala 


Leu 


His 


Thr 


Val 


Arg Ala 




115 










120 


Val 


Gly 


Phe 


Gin 


Arg 


Gin 


Leu 


Gin 




13 0 










135 




Gin 


Tyr 


Arg 


Gin 


Trp 


Leu 


Lys 


Glu 


145 










150 






Asp 


Ala 


Glu 


Glu 


Ala 


Lys 


Asn 


He 








165 








Phe 


Trp 


Ala 


Phe 


Leu Arg Arg 


Leu 








180 











Leu 


Pro 


Gly Leu Tyr 


lie 


Gly 


Asn 




10 








15 




Gin 


Leu 


Ser Lys 


Asn 


Lys 


Val 


Thr 


25 








30 






Ala Arg 


Pro Met 


Leu 


Glu 


Gly 


Val 








45 








Asp 


Ser 


Pro Ser 


Gin 


Asn 


Leu 


Thr 






60 










Phe 


He 


His Glu 


Cys Arg 


Leu 


Arg 






75 








80 


Leu 


Ala 


Gly Val 


Ser Arg 


Ser 


Val 




90 








95 




Thr 


Val 


Thr Asp 


Phe 


Gly 


Trp 


Glu 


105 








110 






Gly Arg 


Ser Cys 


Ala 


Asn 


Pro 


Asn 








125 








Glu 


Phe 


Glu Lys 


His 


Glu 


Val 


His 






140 










Glu Tyr 


Gly Glu 


Ser 


Pro 


Leu 


Gin 






155 








160 


Leu 


Ala 


Ala Pro 


Gly 


He 


Leu 


Lys 




170 








175 





<210> 3 

<211> 2916 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (0) . . • (0) 

<223> PTP4A1 polynucleotide 

<400> 3 

aagggcgcct cggcgcgtgt attggctcct 
tgctccgagc cgctcactgc atggtagagt 
ccaccgccgc tccgccacga ccaccgccgc 
gtgtcgccgc cgctcgggac cggctgtatg 
attcctcaat tctgtggtgt tcttggtcac 
gattactgcc aggcacagca cgacctctat 
tgctccacca agaagccccc ataagagtgg 
cagagcattt tacaagagtt ctgacctgga 
ttggcctcag tattactgga ttgaagaatt 
tatcattact tacaacttca tactcaaagc 
tagacttcag tttctttgga tcatttctgt 



tcggctgcgg gccggctcgc ctacgcgctc 60 
ctggtgcccc cgccgccgcc tgcatcgccg 120 
ctccttgtcc tgcagccacc gccaccgcct 180 
attaggccac aatcttcaat gagtaaacat 240 
acatttatgg agtttctgaa gggcagtgga 3 00 
gcagacaagt gaactgtaga aactgattac 360 
ttatcctgga cacagaagtg ttgaaatcca 420 
tggggtaaac ctcagtgcac ttcttttctg 480 
gctgcttctt gttaggaggt tcatttcact 540 
actgagaatt tcaagtggag tatattgaag 600 
attcaatttt tttaattatt tcataaccct 660 
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attgagtgtt ttttaactaa attaacatgg ctcgaatgaa ccgcccagct cctgtggaag 720 
tcacatacaa gaacatgaga tttcttatta cacacaatcc aaccaatgcg accttaaaca 780 
aatttataga ggaacttaag aagtatggag ttaccacaat agtaagagta tgtgaagcaa 840 
cttatgacac tactcttgtg gagaaagaag gtatccatgt tcttgattgg ccttttgatg 900 
atggtgcacc accatccaac cagattgttg atgactggtt aagtcttgtg aaaattaagt 960 
ttcgtgaaga acctggttgt tgtattgctg ttcattgcgt tgcaggcctt gggagagctc 1020 
cagtacttgt tgccctagca ttaattgaag gtggaatgaa atacgaagat gcagtacaat 1080 
tcataagaca aaagcggcgt ggagctttta acagcaagca acttctgtat ttggagaagt 1140 
atcgtcctaa aatgcggctg cgtttcaaag attccaacgg tcatagaaac aactgttgca 1200 
ttcaataaaa ttggggtgcc taatgctact ggaagtggaa cttgagatag ggcctaattt 1260 
gttatacata ttagccaaca tgttggctta gtaagtctaa tgaagcttcc ataggagtat 1320 
tgaaaggcag ttttaccagg cctcaagcta gacagatttg gcaacctctg tatttgggtt 1380 
acagtcaacc tatttggata cttggcaaaa gattcttgct gtcagcatat aaaatgtgct 1440 
tgtcatttgt atcaattgac ctttccccaa atcatgcagt attgagttat gacttgttaa 1500 
atctattccc atgccagaat cttatcaata cataagaaat ttaggaagat taggtgccaa 1560 
aatacccagc acaatacttg tatattttta gtaccataca gaagtaaaat cccaggaact 1620 
atgaacacta gaccttatgt ggtttattcc ttcagtcatt tcaaacattg aaagtagggc 1680 
ctacatggtt atttggctgc tcactttatg tttacatctc ccacattcat accaatatac 1740 
gtcaggtttg gttaaccatt gatttttttt tttttttacc aagtcttaca gtgattattt 1800 
tacgtgtttc catgtatctc actttgtgct gtattaaaaa aacctccatt ttgaaaatct 1860 
acgttgtaca gaagcacatg tctttaatgt cttcagacaa aaaagcctta cattaattta 1920 
atgtttgcac tctgaggtgc aacttaacag ggagggcctg agaaaagaat gggagggggc 1980 
tattaattat ttttagcaaa atgttgcctt tgtcttgtgc aaacatgtag aatatgctct 2040 
ttaatttagt aaaatatttt tttaaaaggt agagatgctt tgttattgta atcataaact 2100 
tcctgaaatt cttgtaattt ttttcccata cttatcagaa gtgtgtttac caacttattc 2160 
ttgtttgaaa gtgtgatttt ttttttcctt cccaacctct cttgcaaaaa aagaaatggg 2220 
tttctgctaa tgaattgagc agacatctaa tattttatat gccttttgga gctgggtaac 22 80 
ttaatatttg gatacttgac aatttgtttt attatgtaat tgataaaatg gtgatgtgta 2340 
ttaatgttag ttcaaccata tatttatact gtctggggat gtgtggttat agttctgtgg 2400 
gagaaataat tttgtcagtg ttcaccagct tgtaaaaact tagtgcgaga gctgaaacat 2460 
ctaaataaat aatgacatgc atttatcatc attgagattg gtttgcttaa aattaactta 2520 
ttttgtagaa gacaaaatga attgcacttc acttaatgtg tgtcctcatc tttttacaaa 25 80 
taaatgaagg attataaatg atgtcagcat tttagtaaac ttatagacaa aatttgttag 2640 
ggtcattcat gaaaacttta atactaaaag cactttccat tatatacttt ttaaaggtct 2700 
agataatttt gaaccaattt attattgtgt actgaggaga aataatgtat agtagaggac 2760 
agccttggtt tgtaaagctc agctccacta gttcatggtt tggtgcaact tctgagcctc 2 820 
agttctctcc tttgcaaatt aataattaca tacctgccta gatttcggaa attaatctaa 2880 
atattagtat ctggctacat gatggccatg tcaagt 2916 



<210> 4 

<211> 173 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (0) . . . (0) 

<223> PTP4A1 polypeptide sequence 
<400> 4 

Met ala Arg Met Asn Arg Pro Ala Pro Val Glu Val Thr Tyr Lys Asn 

15 10 15 

Met Arg Phe Leu He Thr His Asn Pro Thr Asn Ala Thr Leu Asn Lys 

20 25 30 

Phe He Glu Glu Leu Lys Lys Tyr Gly Val Thr Thr He Val Arg Val 
35 40 45 
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Val 
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Leu 
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Lys 
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Leu Val Glu Lvs 


Glu 


Gly He 


His 


60 








Gly Ala Pro Pro 


Ser 


Asn Gin 


He 
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Lys He Lys Phe 


Arg 


Glu Glu 
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90 
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Val Ala Glv Leu 


Gly 


Arg Ala 


Pro 
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Giu Gly Gly Met 


Lys 
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Arg Arg Gly Ala 
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Asn Ser 
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Arg Pro Lys Met 


Arg 
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Asn Cys Cys He 
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<210> 5 

<211> 2759 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (0) . . . (0) 

<223> PTPN7 polynucleotide sequence 
<400> 5 

ggcacgaggc aagaggcagc ctgggggcca cagctgcttc agcagacctc atggctgagt 60 
gagcctcccc tgggcccagc accccacctc agcatggtcc aagccatggg gggcgctcca 120 
gagcacagcc gttgaccttg tctttggggg cagccatgac ccagcctccg cctgaaaaaa 180 
cgccagccaa gaagcatgtg cgactgcagg agaggcgggg ctccaatgtg gctctgatgc 240 
tggacgttcg gtccctgggg gccgtagaac ccatctgctc tgtgaacaca ccccgggagg 300 
tcaccctaca ctttctgcgc actgctggac acccccttac ccgctgggcc cttcagcgcc 360 
agccacccag ccccaagcaa ctggaagaag aattcttgaa gatcccttca aactttgtca 420 
gccccgaaga cctggacatc cctggccacg cctccaagga ccgatacaag accatcttgc 4 80 
caaatcccca gagccgtgtc tgtctaggcc gggcacagag ccaggaggac ggagattaca 540 
tcaatgccaa ctacatccga ggctatgacg ggaaggagaa ggtctacatt gccacccagg 600 
gccccatgcc caacactgtg tcggacttct gggagatggt gtggcaagag gaagtgtccc 660 
tcattgtcat gctcactcag ctccgagagg gcaaggagaa atgtgtccac tactggccca 720 
cagaagagga aacctatgga cccttccaga tccgcatcca ggacatgaaa gagtgcccag 780 
aatacactgt gcggcacgtc accatccagt accaggaaga gcgccggtca gtaaagcaca 840 
tcctcttttc ggcctggcca gaccatcaga caccagaatc agctgggccc ctgctgcgcc 900 
tagtggcaga ggtggaggag agcccggaga cagccgccca ccccgggcct atcgtagtcc 960 
actgcagtgc agggattggc cggacgggct gcttcatcgc cacgcgaatt ggctgtcaac 1020 
agctgaaagc ccgaggagaa gtggacattc tgggtattgt gtgccaactg cggctagaca 1080 
gaggggggat gatccagacg gcagagcagt accagttcct gcaccacact ttggccctgt 1140 
atgcaggcca gctgcctgag gaacccagcc cctgacccct gccaccctcc ggtggcccag 1200 
gtgcctacct ccctcaagcc tgggaaggtg ggtctgggga aagtgggccg agtgatctgg 1260 
gggtaccctt gggttggtgt ggggaaggag tgcctcctta gtggtgcttg acagtcacag 1320 
gaagcagcag cagtaaggac aaggggccgg attcaggtct tcaaccactg gccactcctc 1380 
ttgccttcct ctgttggccc cagatggaca gtaaggggaa cctccaatgt ctctctgaac 1440 
ttaaagacag gagctggcat ttatgacaga caaagaaaga agcccaggtg tcctggtgtt 1500 
ctctgagaca ctctttgtga tcttcagttt cctgttctat aacatgaaca taagtgctta 1560 
gctgccatga gggaaaagta atgagagaag ttctagaagc cactccagcc actccttcct 1620 
ggggctgaca aaagggtgat tccaagatca tccttcaccc gaggtcctgc ccaagcacag 1680 
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gccagatgca agaatgggga aaagtctggt cctgatctcc aagtctcaac atcctatcag 1740 
tgactctgcc tccctgacca cacatcggaa gggcctggat gacccaatca aaagaaagaa 1800 
caaggactct ggttaccctt gcctccaccc atgtgtcata agagtaggct acagaggtga 1860 
ccaggcctgg cagttgaaat ctctggaaga gggaacatgt ggggactact cagaggcaaa 1920 
gaggagctgc tcctgcctcc atggttgctg gccactccca ccaactactc ttagggaggc 1980 
taagcagtct ctgttttgac cttccatggc tcaataatac ctggatgcag gaccactata 2040 
ccttgcattt gctgagtaca cctagagagc ttggctgttt ccaaaaacaa tcagggtcat 2100 
aaccatccat gcagacatgg aggctcggct gaaccaggac tcctcactgt ctacctgaga 2160 
gaatgagcac ccctcatcca tctcagcatc aacacaattt ccaggggacc tcaggtctac 2220 
ctcaggactg aaccgccaca cctcaggatt cctcctcctt gaatctgaga ctggctgccc 2280 
attctgagat ggggatgaag gtaagatgcc gcatcaccag cacgccgccc ctgacagctg 2340 
ccttgatacc agctctctgt ggaaaccccc gaggagttgg atctggagaa cagctgggcc 2400 
tcctcactca ggacttctct cctgaagaac acgcagtgct aaaactgagg atgatttccc 2460 
taatgcttct gcttggagtc tcttatggag gagctgctcc ttccttacag cttggggatg 2520 
gacttcccac acctccacct cccctgagcc ctgagccctg tgagaggacg actgtctatg 2580 
caatgaggct cggtgggggg ctctcaagtg cctgatcctg cctggctcag aggcagccag 2640 
agggaagcaa ctgacagccc cacaggccct ccctggcact gtccccatct cagagctcag 2700 
gagggtacaa gctccagaac agtaaccaag tgggaaaata aagacttctt ggatgactg 2759 



<210> 6 

<211> 339 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> UNSURE 
<222> (0) . . . (0) 

<223> PTPN7 polypeptide sequence 



<400> 6 












Lys His Val 




Met Thr 


Gin Pro 


Pro 


Pro 


Glu 


Lys Thr Pro Ala Lys 


Arg 


1 




5 






10 


15 




Leu Gin 


Glu Arg 


Arg 


Gly 


Ser 


Asn Val Ala Leu Met 


Leu Asp Val 


Arg 




20 








25 


30 




Ser Leu 


Gly Ala 


Val 


Glu 


Pro 


He Cys Ser Val Asn 


Thr Pro Arg 


Glu 




35 








40 


45 




Val Thr 


Leu His 


Phe 


Leu 


Arg 


Thr Ala Gly His Pro 


Leu Thr Arg 


Trp 


50 








55 


60 






Ala Leu 


Gin Arg 


Gin 


Pro 


Pro 


Ser Pro Lys Gin Leu 


Glu Glu Glu 


Phe 


65 




70 




75 


Leu Asp He 


80 


Leu Lys 


lie Pro 


Ser 


Asn 


Phe 


Val Ser Pro Glu Asp 


Pro 




85 






90 


95 




Gly His 


Ala Ser 


Lys 


Asp 


Arg 


Tyr Lys Thr He Leu 


Pro Asn Pro 


Gin 


100 








105 


110 




Ser Arg 


Val Cys 


Leu 


Gly 


Arg 


Ala Gin Ser Gin Glu Asp Gly Asp Tyr 


115 








120 


125 




lie Asn 


Ala Asn 


Tyr 


lie 


Arg 


Gly Tyr Asp Gly Lys 


Glu Lys Val 


Tyr 


130 






135 


140 




Glu 


lie Ala 


Thr Gin 


Gly 


Pro 


Met 


Pro Asn Thr Val Ser Asp Phe Trp 


145 




150 




155 


Leu Thr Gin 


160 


Met Val 


Trp Gin 


Glu 


Glu 


Val 


Ser Leu He Val Met 


Leu 




165 






170 


175 


Glu 


Arg Glu 


Gly Lys 


Glu 


Lys 


Cys 


Val His Tyr Trp Pro 


Thr Glu Glu 


180 








185 


190 




Thr Tyr 


Gly Pro 


Phe 


Gin 


lie 


Arg He Gin Asp Met 


Lys Glu Cys 


Pro 


195 








200 


205 
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Glu 


Tyr 


Thr Val 


Arg 


His 


Val Thr 




210 








215 


Ser 


Val 


Lys His 


He 


Leu 


Phe Ser 


225 








230 




Glu 


Ser Ala Gly 


Pro 


Leu 


Leu Arg 








245 






Pro 


Glu 


Thr Ala 


Ala 


His 


Pro Gly 






260 








Gly 


He 


Gly Arg 


Thr 


Gly 


Cys Phe 




275 






280 


Gin 


Leu 


Lys Ala 


Arg 


Gly 


Glu Val 




290 








295 


Leu 


Arg 


Leu Asp 


Arg 


Gly 


Gly Met 


305 








310 




Phe 


Leu 


His His 


Thr 


Leu 


Ala Leu 








325 






Pro 


Ser 


Pro 









He 


Gin 


Tyr Gin Glu 


Glu Arg 


Arg 






220 








Ala Trp 


Pro Asp His 


Gin 


Thr 


Pro 






235 






240 


Leu 


Val 


Ala Glu Val 


Glu 


Glu 


ser 




250 






255 




Pro 


He 


Val Val His 


Cys 


Ser 


Ala 


265 






270 






He 


Ala 


Thr Arg He 


Gly Cys 


Gin 






285 








Asp 


He 


Leu Gly He 


Val 


Cys 


Gin 






300 








He 


Gin 


Thr Ala Glu 


Gin 


Tyr 


Gin 






315 






320 


Tyr 


Ala 


Gly Gin Leu 


Pro 


Glu 


Glu 


330 






335 





<210> 7 

<211> 3960 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> (0) . . • (0) 

<223> FEM-2 polynucleotide 

<400> 7 

ggacacggag ccgcgaggag acagctgagg 
accctcttgc cctggcctag ctgcaggccc 
cccacagaag agcagcccaa tggccagtgg 
gctcctgcaa gacttcccag ccctgctgaa 
agggacggtg ctcagccagg aggaggtgga 
tctgggcagc aggaaggccc cgccaccact 
acagctgcta cagacagacc tttccgaatt 
ggaggaggac gatgacgagg aggaaaaggc 
ggcacagagt ttctttaacc gcctttggga 
attggctgcc cgggcctcac agcggcagtg 
tcgccgcaag atggaggacc ggcacgtgtc 
gtctgaccct gtgaaccgcg cctactttgc 
tgcgaggtac gccgctgtcc acgtgcacac 
agaccctgag ggagccctca gagaagcctt 
agccaagcga gagcggctgc agagcggcac 
gaccctgcac gtcgcctggc tcggggattc 
ggtgaagctg atggagccac acagaccaga 
attgggtggc tttgtgtctc acatggactg 
cagagccatc ggggatgtct tccagaagcc 
ccgggcgctg acgggctccg aggactacct 
cgtaccccac caggaagttg ttggcctggt 
cgggctccgt gtcgccgagg agctggtggc 
catcacggtc atggtggtct tcctcaggga 
gggagaaggg gacccccagg cagaagggag 
acctgagacc caggctccac caagaagcta 
ctcccatcct tgtccttctc tccctcagaa 



cccgcggaga ccagggggtg aagcctggag 60 
ccgggatgct ttgggcatgt cctctggagc 120 
agctgaggag accccaggct tcctggacac 180 
cccagaggac cctctgccat ggaaggcccc 240 
gggcgagctg gctgagctgg ccatgggctt 300 
tgctgctgct ctggcccacg aagcagtttc 360 
caggaagttg cccagggagg aagaagaaga 420 
ccctgtgacc ttgctggatg cccaaagcct 480 
agtcgccggc cagtggcaga agcaggtgcc 540 
gctggtctcc atccacgcca tccggaacac 600 
cctcccttcc ttcaaccagc tcttcggctt 660 
tgtgtttgat ggtcacggag gcgtggatgc 720 
caacgctgcc cgccagccag agctgcccac 780 
ccggcgcacc gaccagatgt ttctcaggaa 840 
cacaggtgtg tgtgcgctca ttgcaggagc 900 
ccaggtcatt ttggtacagc agggacaggt 960 
acggcaggat gagaaggcgc gcattgaagc 1020 
ctggagagtc aacgggaccc tggccgtctc 1080 
ctacgtgtct ggggaggccg atgcagcttc 1140 
gctgcttgcc tgtgatggct tctttgacgt 1200 
ccagagccac ctgaccaggc agcagggcag 1260 
tgcggcccgg gagcggggct cccacgacaa 1320 
cccccaagag ctgctggagg gcgggaacca 13 80 
gaggcaggac ttgccctcca gccttccaga 1440 
ggtggtttcc aggcccctgc cctccccttc 1500 
gcctcaggac ccaacaggtg gcaggcagtg 1560 
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gacagggtgc 
cgcagcccgt 
cttaggagac 
cggggtggga 
atggggcagc 
agcaccccct 
tctgcggcca 
cacaccagtg 
cccccctgcc 
tgctctcttc 
ttttttccag 
tgccaagatg 
tagaaaactt 
ttattgaatt 
agccattcaa 
cccattgagg 
atctttctgg 
ccccccgctg 
gacagcgggg 
ttgggcctgg 
aatatcttgt 
tcaggcaagc 
tgctgggagt 
cctgccttgc 
cctgagaggg 
gcaaagttga 
ctaggggagg 
tgggcagggc 
aagctgggct 
ccaggaaggt 
ggcacgctag 
ctttgctggg 
gttgagacca 
acatagtcgc 
tgcttggctt 
ctcaccacta 
atctgctttt 
accatgtgga 
ccactggaag 
actgccgctc 



ccgccccaca 
cctggtggct 
ctcaccaaag 
ccagaggtca 
atggttgtgc 
gggtcaggcc 
gggccaggtg 
tcccaaagag 
cccactccag 
cctgagcccg 
ggaggtctaa 
acctgttgtg 
aagtatttac 
tattttctaa 
gtgccttaca 
tttttagaaa 
aagagttttc 
cccaggtgac 
ctctgggcat 
ctggatccca 
gaatttgtaa 
acagaggcag 
ccatggcaca 
tctgccccat 
ggctcacaag 
aagtggcaag 
actcagaggc 
tgctccaggc 
gtgtggtccc 
ttgcacagct 
agccgccttc 
gccctacctg 
tgcagtcaat 
tgtcattcag 
aacttgaagg 
gcccaccctg 
ctcttttctt 
ggtttggctc 
atctcgcagc 
cccacccctg 



gtgctttccc 
gtggaactgc 
agaagatgga 
taggtgccca 
acgtgggacc 
cagcagcgga 
ggttcatatt 
gagagcgcag 
ggcagccatg 
gaaccggcga 
ttccgaagca 
tcatataatt 
ttttttaaag 
atcaggattg 
cagctttgaa 
agcaggattt 
cgcctctttg 
tggcaggagc 
cccgggtgac 
ggcagaggga 
tcaaattggt 
cccgtgggtc 
ggccacagcc 
gccctgaaca 
tgaccgatcc 
gctgggggag 
catctgcagg 
aggtcagagc 
catgaaggca 
gtgccccaag 
ccggtgggag 
catgcccagc 
gcattgctta 
acattcagag 
cctgctgtct 
cccgtgggca 
tgctaataat 
ctatcacctg 
ctcagtgtat 
agccaggacc 



cagcacccca 
actgggtggc 
ccggctcttg 
acggcagcca 
ctggggcgga 
gtggccagcc 
tacagatatg 
cagagccagg 
atgtgcttgg 
cgcacatgtg 
gtattccagg 
taagcagagc 
ctatttttca 
aaacaggaac 
gaaactagga 
gttttgttag 
tgatgctgaa 
tgcgactgcc 
cttggcccat 
ccttgctgct 
ctcattggga 
tctgtctcag 
cctcaccttg 
ggcatgagag 
tgggtgcctc 
ggtgtcgggt 
gccaagccac 
agtgcagggg 
ttcagagtcc 
caccttggcc 
ccctctgtcc 
cttacccctc 
aggtctctta 
aatggttggc 
ccttctgggg 
taaccttgac 
tctgttcctg 
gtggtctgat 
gggaaaccct 
ccagaggatg 



gagccagtcg 
gggcagatgg 
ctcccagctc 
aaccaaagac 
cccaggagcc 
ccagtttccc 
cccagccagt 
ggtctgttct 
cccaccaggg 
tcttttgttg 
ttttctcttt 
ttagcattta 
aggaaccttt 
ttttccaggt 
ctgcagtggg 
ggaggcatga 
cacccccaag 
acgtagtgtt 
ctgcctgcat 
gtgtgattgg 
aagactctta 
tctggaggca 
ccgcggtggc 
ctccacgtcc 
agggagctca 
agagggaaga 
aggaagggct 
gaggagagga 
acctgcagac 
tcctctcagc 
cacagggagc 
attctcacag 
tttacaaaaa 
cacaaacaat 
gtcagggacg 
gaagagagag 
gctgccgaga 
tcatacccta 
ttccccaggc 
gccatgcccc 



ggacaccccc 
tggaaggcag 
ctattaggcc 
actggtgtgc 
aaactcttga 
attgctcctc 
cctggtcggc 
gtagcagcca 
ccttccgggc 
gtgtgtttgt 
gttttatcag 
ttttattctt 
ttttgcagta 
ggtgttaata 
ctcggatagg 
ttttggtgag 
gttctcccct 
gcctgggccc 
tcccaccccc 
aacattccca 
attaagaggc 
gcagggatgc 
tggcagcacg 
cctagtgcac 
ctgagggcgt 
gggcaggggg 
gagctggagg 
gaaagggagg 
agcgagagcc 
tcgccgagga 
ggggagccag 
cacagatgag 
aaaaccttaa 
gaccaagtat 
cagctccacc 
aatgattggc 
gtgaagtttc 
gcctgaggct 
ttgtcccagc 
gtgcctggca 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



<210> 8 
<211> 454 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (0) . . . (0) 

<223> FEM-2 polypeptide sequence 
<400> 8 

Met Ser Ser Gly Ala Pro Gin Lys Ser Ser Pro Met ala Ser Gly Ala 

1 5 10 15 

Glu Glu Thr Pro Gly Phe Leu Asp Thr Leu Leu Gin Asp Phe Pro Ala 
20 25 30 
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Leu 


Leu 


Asn 
35 


Pro 


Glu 


Asp 


Pro 


Leu 
40 


Pro 


Trp Lys Ala Pro Gly 
45 


Thr 


Val 


Leu 


Ser 


Gin 


Glu 


Glu 


Val 


Glu Gly 


Glu 


Leu Ala Glu Leu Ala Met Gly 




•J V 










55 






60 






Phe 


Leu 


Glv 

***** jr 


Ser 


Arg 


Lys 


Ala 


Pro 


Pro 


Pro Leu Ala Ala Ala 


Leu 


Ala 


£ C 

O J 








70 








75 




80 


His 


Glu 


Ala 


Val 


Ser 


Gin 


Leu 


Leu 


Gin 


Thr Asp Leu Ser Glu Phe Arg 










85 










90 


95 




Lys 


Leu 


Pro 


Arcr 


Glu 


Glu 


ux u. 


V? JL LX 


Glu 


Glu Glu Asp Asp Asp 


Glu 


Glu~ 






100 










105 


110 






Glu 


Lvs 


Ala 


Pro 


Val 


Thr 


Ttf^ii 

J-1CU 




Asp 


Ala Gin Ser Leu Ala 


Gin 


Ser 




115 










120 




125 






Phe 


Phe 


Asn 


Arg 


Leu 


Trp 


Glu 


Val 


Ala 


Gly Gin Trp Gin Lys 


Gin 


Val 














135 






140 




His 


Pro 


Leu 


Ala 


Ala 


Arg 


Ala 


Ser 


Gin 


Arg 


Gin Trp Leu Val Ser 


He 


JL"3t 3 










150 








155 




160 


Ala 


lie 


Arg 


Asn 


Thr 


Arg 


Arg 


Lys 


Met 


Glu Asp Arg His Val 


Ser 


Leu 








165 










170 


175 






Ser 


Phe 


Asn 
180 


Gin 


Leu 


Phe 


Gly 


Leu 
185 


Ser Asp Pro Val Asn 
190 


Arg 


Ala 




Phe 


Ala 


Val 


Phe 


Asp 


Gly His 


Gly 


Gly Val Asp Ala Ala 


Arg 


Tvr 




JL 27 ZJ 










200 




205 






Ala 


Ala 
210 


Val 


His 


Val 


His 


Thr 
215 


Asn 


Ala 


Ala Arg Gin Pro Glu 
220 


Leu 


Pro 


Thr 


Asp 


Pro 


Glu 


Gly 
^ jr 


Ala 


Leu Arg 


Glu 


Ala Phe Arg Arg Thr Asp 


Gin 


a a 








230 








235 




240 


Met 


Phe 


Leu 


Arcr 


LVS 

JT " 


Ala 


Lys 


Arg 


Glu 


Arg Leu Gin Ser Gly Thr 


Thr 








245 










250 


255 




Gly 


Val 


Cys 


Ala 


Leu 


lie 


Ala 


Gly 


Ala 


Thr Leu His Val Ala 


Trp 


Leu 




260 










265 


270 






uiy 


X**DJLy 


Ser 


Gin 


Val 


He 


Leu 


Val 


Gin 


Gin Gly Gin Val Val 


Lys 


Leu 












280 




285 






Met 


Glu 


Pro 


His 


Arcr 


Pro 


Glu Arg 


Gin 


Asp Glu Lys Ala Arg 


He 


Glu 














295 






300 






Ala 


Leu 


Glv 


Gly 


Phe 


Val 


Ser 


His 


Met 


Asp Cys Trp Arg Val Asn Gly 


j u j 








310 








315 




320 


Thr 


Leu 


Ala 


Val 


Ser 


Arcr 


Ala 


He 


Gly 


Asp Val Phe Gin Lys 


Pro 


Tyr 










325 








330 


335 




Val 

V Ct JL 


Ser 


Glv 

vjjLjt 


Glu 


Ala 


Asp 


Ala 


Ala 


Ser 


Arg Ala Leu Thr Gly Ser 


Glu 






340 










345 


350 






Asp 




Leu 


Leu 


Leu 


Ala 


Cys 


Asp 


Gly 


Phe Phe Asp Val Val 


Pro 


His 


■ace 

3 3 3 










360 




365 








Gin 


Val 


Val 


Glv 


Leu 


Val 


Gin 


Ser 


His Leu Thr Arg Gin Gin Gly 




J / u 










375 






380 






Ser 


Gly 


Leu 


Arcr 


Val 


Ala 


Glu 


Glu 


Leu 


Val Ala Ala Ala Arg Glu Arg 


385 






390 








395 




400 


Gly 


Ser 


His 


As P 


Asn 


He 


Thr 


Val 


Met 


Val Val Phe Leu Arg 


Asp 


Pro 






405 










410 


415 




Gin 


Glu 


Leu 


Leu 


Glu 


Gly 


Gly Asn 


Gin 


Gly Glu Gly Asp Pro 


Gin 


Ala 








420 










425 


430 






Glu 


Gly 


Arg 


Arg 


Gin 


Asp 


Leu 


Pro 


Ser 


Ser Leu Pro Glu Pro 


Glu 


Thr 




435 










440 




445 






Gin 


Ala 
450 


Pro 


Pro 


Arg 


Ser 















<210> 9 
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<211> 2786 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (0) (0) 

<223> DKFZP566K0524 polynucleotide 
<400> 9 

tggacccaac tggcgaggct gctggggttg cagcgggaca gttggggcgg ccccgcaggc 60 
ccaggttttt gaaaataaag ttaattcaga gaaggtaaaa ctttctcttc ggaatttccc 120 
acataatgat tatgaggatg tttttgaaga gccttcagaa agtggcagtg atcccagcat 180 
gtggacagcc agaggcccct tcagaagaga caggtggagc agtgaggatg aggaggctgc 240 
agggccatca caggctctct cccctctact ttctgatacg cgcaaaattg tttctgaagg 300 
agaactagat cagttggctc agattcggcc attaatattc aattttcatg agcagacagc 360 
catcaaggat tgtttgaaaa tccttgagga aaaaacagca gcgtatgata tcatgcagga 420 
atttatggct ttagaactta agaatctgcc tggtgagttc tactctggga atcaaccaag 4 80 
caacagagaa aaaaacagat accgagatat tcttccatat gattcaacac gcgttcctct 540 
tggaaaaagc aaggactaca tcaatgctag ttatattaga atagtcaatt gtggagaaga 600 
gtatttttat atcgctactc aaggaccact gctgagcacc atagatgact tttggcaaat 660 
ggtgttggaa aataattcaa atgttattgc catgataacc agagagatgg aaggtggaat 720 
tatcaaatgc taccattact ggcccatttc tctgaagaag ccattggaat tgaaacactt 780 
ccgtgtattc ctggagaact accagatact tcaatatttc atcattcgaa tgtttcaagt 840 
tgtggagaag tccacgggaa ctagtcactc tgtaaaacag ttgcagttca ccaagtggcc 900 
agaccatggc actcctgcct cagcagatag cttcataaaa tatattcgtt atgcaaggaa 960 
gagccacctt acaggaccca tggttgttca ctgcagtgcc ggcataggcc ggacaggggt 1020 
gttcctatgt gtggatgtcg tgttctgtgc catcgtaaag gactgttcat tcaacatcat 1080 
ggatatagtg gcccaaatga gagaacaacg ttctggcatg gttcaaacga aggagcagta 1140 
tcacttttgt tacgatattg tgcttgaagt tcttcggaaa cttctgactt tggattaaga 12 00 
aagacttctg ttgcctctca cttgaaatta ccaagtgggt ttgcacctcc tcataaagaa 1260 
catgtttgca ctgtgctgaa gggctttgct atgcatacaa tctgctttct tggtttatca 1320 
gtttattttc tttctaaaag ctccctgaag ggcaatatca tttggcttgg ggtgatcagt 13 80 
gtttacttat tgatcttgct agacaatatc aaaataactt cccacatttt ccagtgaaac 144 0 
agatgttaca taaaacgatt gcagcttggc tatttggttg aagggattac agagcccaat 1500 
aaaggattta aaatatattc attaagattt tatttggaaa ggtggctgga gagagctgag 1560 
gatttccagg actttgtaag ttcttattct gggagaacat aaggccaata atcatgacct 1620 
cttccaggca tttttaagac agatgtctat tcatgttctt tagctagagc ctgtactttt 1680 
tgctggcatt tgaataaccc agtttaaaaa gagtccagtt agggtggact aactttggac 1740 
acaaattggc ttccatttcc tacattttca tactgctgcc ttcctacagc tgctagacca 1800 
agacctgttg gtctgggaag catttcatgg atagggagag ctcctctcgg tgaacagtcc 1860 
aaaactaaaa tagatgttta tatagaaagc ccaagaggag atttttgcca tgcctgagtt 1920 
ctttcctatc ccaccctaac acttaacata ttacttagtc tgctttgtta aaagcaagta 1980 
ttacctttaa cttgcctctt actctttgcc ctttagctaa ctaataaagt ttgatatggg 2040 
cattattata taattctgag tcattcatgg tatctctcat gtttgatgta tttttcaaac 210 0 
taagatctat gatagttttt ttttccagag ttccatcaaa tcatttattt cctttacttt 2160 
ctcacctctg ttgaaacatt tagaaactgg atttgggaac ccaattttgg aaaaccagat 2220 
tcatagtcat gaaaatggaa acttccatat tctgtttttg aaaagatgtg gccattatta 2280 
cagtaatttt attataggac tttgcctcgt acaattaata gtgatatttt ggacaaggag 2340 
ttctggtgac aagctatacc taattataag ctataaaaca atagatatga gtgtttgtac 2400 
agtttaactc aatggagatc agaatattct atgtattgag aaaatgttta atatcaatct 2460 
ataaatcttg aatttctaag aggcttattt tgttcttttg gctgaatgag tatatttgaa 2520 
ttggttgaat aattaataat tctcattgta aaaataatta tatgccaaaa atatatttga 2580 
tgttaaatca aatagatgat tctgtttaca ttgttcatat gaataataat ctgtgttaat 2640 
ttcattttga taattggcct ttaatatttg tatctctaat tttattttct ctctgttact 2700 
gtaaaataat agctataatg tataacaatt ttcttcagaa gaattctatg ctattattaa 2760 
aataaaatat ttactgaaaa aaaaaa 2786 
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<210> 10 

<211> 398 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (0) . . . (0) 

<223> DKFZP566K0524 polypeptide 



<400> 10 



Gly 


Pro 


Asn 


Trp 


Arg 


va±y 


Cys 


Trp Gly Cys 


Ser 


Gly Thr 


Val Glv Ala 


1 








c 

b 








10 








15 


Ala 


Pro 


Gin 


Ala 


Lj±n 


vax 


Phe 


Glu Asn 


Lys 


Val 


Asn 


Ser 


V7JLIA XJjr 0 Val 








A U 








25 










30 


Lys 


Leu 


Ser 


Leu 


Arg 


Asn 


Phe 


Pro His 


Asn Asp 


Tyr Glu 






35 










40 








45 




Glu 


Glu 


Pro 


Ser 


Glu 


Ser 


Gly Ser Asp 


Pro 


Ser 


Met 


Trp 


inr Ala iiiy 




50 










55 








60 






Gly 


Pro 


Phe 


Arg 


Arg 


Asp 


Arg Trp Ser 


Ser 


Glu 


Asp 


Glu 


VjXU Ala Ala 


65 










70 








75 








Gly 


Pro 


Ser 


Gin 


Ala 


Leu 


Ser 


Pro Leu 


Leu 


Ser 


Asp 


Thr 


Aiy layo lie 








85 








90 










Val 


Ser 


Glu 


Gly 


Glu 


Leu 


Asp 


Gin Leu 


Ala 


Gin 


He 


Arg 


JrXVO JLicU 11c 








100 








105 










110 

XXV 


Phe 


Asn 


Phe 


His 


Glu 


Gin 


Thr 


Ala He 


Lys Asp 


Cys 


Leu 


jjys lie iieu. 






115 










120 








125 




Glu 


Glu 


Lys 


Thr 


Ala 


Ala 


Tyr Asp He 


Met 


Gin 


Glu 


Phe 


Met aia jueu 




130 










135 








140 






Glu 


Leu Lys 


Asn 


Leu 


Pro 


Gly Glu Phe Tyr 


Ser 


Gly Asn 


bin jrro oer 


145 










150 








155 






1DU 


Asn 


Arg 


Glu 


Lys 


Asn Arg 


Tyr Arg Asp 


He 


Leu 


Pro 


Tyr 


Asp oer inr 








165 








170 








1/3 


Arg 


Val 


Pro 


Leu 


Gly Lys 


Ser 


Lys Asp 


Tyr 


He 


Asn 


Ala 


oer iyr lie 






180 








185 










i on 


Arg 


He 


Val 


Asn 


Cys 


Gly 


Glu Glu Tyr 


Phe 


Tyr 


He 


Ala 


inr bin v^iy 




195 










200 








205 




Pro 


Leu 


Leu 


Ser 


Thr 


He 


Asp Asp Phe 


Trp 


Gin 


Met 


Val 


T.oii filii JVori 
JUCU VjIU ribll 




210 










215 








220 






Asn 


Ser 


Asn 


Val 


He 


Ala 


Met 


He Thr Arg 


Glu 


Met 


Glu 


m-v filv Tie 
uiy vjxy xxc 


225 










230 








235 






240 


lie 


Lys 


Cys 


Tyr 


His 


Tyr 


Trp 


Pro He 


Ser 


Leu 


Lys 


Lys 


Pro Leu Glu 










245 








250 








255 


Leu 


Lys 


His 


Phe 


Arg 


Val 


Phe 


Leu Glu 


Asn 


Tyr 


Gin 


He 


Leu Gin Tyr 






260 








265 










270 


Phe 


He 


He 


Arg 


Met 


Phe 


Gin 


Val Val 


Glu 


Lys 


Ser 


Thr 


Gly Thr Ser 






275 










280 








285 




His 


Ser 


Val 


Lys 


Gin 


Leu 


Gin 


Phe Thr 


Lys 


Trp 


Pro Asp 


His Gly Thr 




290 










295 








300 






Pro 


Ala 


Ser 


Ala 


Asp 


Ser 


Phe 


He Lys 


Tyr 


He 


Arg 


Tyr 


Ala Arg Lys 


305 










310 








315 






320 


Ser 


His 


Leu 


Thr 


Gly 


Pro 


Met 


Val Val 


His 


Cys 


Ser 


Ala 


Gly He Gly 










325 








330 








335 


Arg 


Thr Gly 


Val 


Phe 


Leu 


Cys Val Asp Val Val 


Phe 


Cys 


Ala He Val 








340 








345 










350 


Lys 


Asp 


Cys 


Ser 


Phe 


Asn 


He 


Met Asp 


He 


Val 


Ala 


Gin 


Met Arg Glu 




355 










360 








365 
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Gin Arg Ser Gly Met Val Gin Thr Lys Glu Gin Tyr His Phe Cys Tyr 

370 375 380 

Asp lie Val Leu Glu Val Leu Arg Lys Leu Leu Thr Leu Asp 
385 390 395 



<210> 11 
<211> 2226 
<212> DNA . — 
<213> Homo sapiens 



<220> 

<22l> misc_f eature 
<222> (0)...(0) 

<223> FLiJ20313 nucleotide sequence 



<400> 11 

ctcctctgcg cttccgtgga gcctccaggc 
gctgcgcgcg tctccctgcc cacagcagcg 
cgccttctcg ggagccccca ctggctcggc 
aaccgaagat taaaaaactg gagccagtcc 
tcaattttgt gagaaaatgc attgcaacag 
tgatatgcag taacttcaaa atctccttta 
tccattacag aaaccttctt cttggtgaac 
ttgtcacagt aaacgaccac aagaggaagc 
aatttaatcc aacagagtta attatttatt 
ttgatgaatc aggtcccgaa agtgctaaaa 
agccaacaga cctccagcta ctctttgcat 
cagcaaacaa aattaatgga attccctcag 
atggagctgg tggtggcagc agccagaaaa 
acagagaaat caagaggaea ggtgcttccg 
acatgatatc cacttgcctt ccagaataca 
atctaaagat cttttcccat tcttttgttg 
actctaacgg cagtgctctt gtgcgaatgg 
agattgacca gaggatttgt aatgcaataa 
acaaatcaga tttggataag accttgccta 
aactgaagca gctatgcgtt aatgagcctt 
cactggaaaa tactcgatgg ttagaatatg 
ttgtatacat gctagaaagc aaacatctct 
acttgagctg ttgtgtagct tctcttgttc 
ttactggatt tcagagtctg atacagaagg 
acagatgcaa ccatctaaag agatcagaga 
atgccacctg gcagctgtta gaacaatatc 
tggcagtgtt gtatgacagc acccggatct 
ctcaccagcg agtgaagcaa agcacggtca 
attttccttc acgagtttga agtttctggt 
taagcagggt ctcaagccaa acttgaaacc 
tcgaaccaac aacagaagct gacaaatgcc 
cagctcttgt ggaccttcag gaagcctttc 
cggggtcttc agcctcagcc gtaaggccct 
aacctgagtt ctggccctgg tctgggaaaa 
gcctgagagt ccaggttgct aacctgtaaa 
tagtctcatt gtatgaggtg taacattttg 
aaagacatat gaattattct ggatttgttg 
aaaagt 
<210> 12 
<211> 451 



cgacccccgg gaactggagg accccaggag 60 
cggctgcctg attcccggcg ccgcgaaatg 120 
gaaaacttac tgacgataag atcaattcgg 180 
ttttgccagg agaaattgtc gtaaatgaag 240 
acacaagcca gtacgatttg tggggaaagc 300 
ttacagatga cccaatgcca ttacagaaat 360 
acgatgtccc tttaacatgt attgagcaaa 420 
agaaagtcct aggccccaac cagaaactga 480 
gtaaagattt cagaattgtc agatttcgct 540 
aggtatgcct tgcaatagct cattattccc 600 
ttgaatatgt tgggaaaaaa taccacaatt 660 
gagatggagg aggaggagga ggaggaggta 720 
ctccactctt tgaaacttac tcggattggg 780 
ggtggagagt ttgttctatt aacgagggtt 840 
ttgtagtgcc aagttcttta gcagaccaag 900 
ggagaaggat gccactctgg tgctggagcc 960 
ccctcatcaa agacgtgctg cagcagagga 1020 
ctaaaagtca cccacagaga agtgatgttt 1080 
atattcaaga agtacaagca gcatttgtaa 1140 
ttgaagaaac tgaagagaaa tggttatctt 1200 
taagggcatt ccttaagcat tcagcagaac 1260 
ctgtagtcct acaagaggag gaaggaagag 132 0 
aagtgatgct ggatccctat tttaggacaa 1380 
agtgggtcat ggcaggatat cagtttctag 1440 
aagagtctcc tttatttttg ctattcttgg 1500 
ctgcagcttt tgagttctcc gaaacctacc 1560 
cactgtttgg caccttcctg ttcaactccc 1620 
gtaggataaa aagttgtaca aaacaagatt 1680 
cacaattcat tgatgtagag gatttatgac 1740 
attctgaacc aaagtgccat ttcacccacc 1800 
gtggagacca ttgagggaaa cagaaagggg 1860 
taggaagagg attgccctca tagtgagctc 1920 
999ctaggca gtgtgaccta gggagcggga 1980 
gtgctaggcc catgttccac tcaggcttca 2040 
atggatctgt caaactaaca cttatgcctt 2100 
taaactgtga atcattatgc aaattttcct 2160 
gtataaaaga caaaatacac tggtcaaaaa 2220 

2226 
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• 
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<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> (0) . . . (0) 

<223> FLJ20313 polypeptide sequence 

<400> 12 — 

Met Pro Leu Gin Lys Phe His Tyr Arg Asn Leu Leu Leu Gly Glu His 

15 10 15 

Asp Val Pro Leu Thr Cys lie Glu Gin He Val Thr Val Asn Asp His 

20 25 30 

Lys Arg Lys Gin Lys Val Leu Gly Pro Asn Gin Lys Leu Lys Phe Asn 

35 40 45 

Pro Thr Glu Leu He He Tyr Cys Lys Asp Phe Arg He Val Arg Phe 

50 55 60 

Arg Phe Asp Glu Ser Gly Pro Glu Ser Ala Lys Lys Val Cys Leu Ala 
65 70 75 80 

He Ala His Tyr Ser Gin Pro Thr Asp Leu Gin Leu Leu Phe Ala Phe 

85 90 95 

Glu Tyr Val Gly Lys Lys Tyr His Asn Ser Ala Asn Lys He Asn Gly 

100 105 HO 

He Pro Ser Gly Asp Gly Gly Gly Gly Gly Gly Gly Gly Asn Gly Ala 

115 120 125 

Gly Gly Gly Ser Ser Gin Lys Thr Pro Leu Phe Glu Thr Tyr Ser Asp 

130 135 140 

Trp Asp Arg Glu He Lys Arg Thr Gly Ala Ser Gly Trp Arg Val Cys 
145 150 155 160 

Ser He Asn Glu Gly Tyr Met lie Ser Thr Cys Leu Pro Glu Tyr He 

165 170 175 

Val Val Pro Ser Ser Leu Ala Asp Gin Asp Leu Lys He Phe Ser His 

180 185 190 

Ser Phe Val Gly Arg Arg Met Pro Leu Trp Cys Trp Ser His Ser Asn 

195 200 205 

Gly Ser Ala Leu Val Arg Met ala Leu He Lys Asp Val Leu Gin Gin 

210 215 220 

Arg Lys He Asp Gin Arg He Cys Asn Ala He Thr Lys Ser His Pro 
225 230 235 240 

Gin Arg Ser Asp Val Tyr Lys Ser Asp Leu Asp Lys Thr Leu Pro Asn 

245 " 250 255 

He Gin Glu Val Gin Ala Ala Phe Val Lys Leu Lys Gin Leu Cys Val 

260 265 270 

Asn Glu Pro Phe Glu Glu Thr Glu Glu Lys Trp Leu Ser Ser Leu Glu 

275 280 285 

Asn Thr Arg Trp Leu Glu Tyr Val Arg Ala Phe Leu Lys His Ser Ala 

290 " 295 300 

Glu Leu Val Tyr Met Leu Glu Ser Lys His Leu Ser Val Val Leu Gin 
305 310 315 320 

Glu Glu Glu Gly Arg Asp Leu Ser Cys Cys Val Ala Ser Leu Val Gin 

325 330 335 

Val Met Leu Asp Pro Tyr Phe Arg Thr He Thr Gly Phe Gin Ser Leu 

340 345 350 

He Gin Lys Glu Trp Val Met ala Gly Tyr Gin Phe Leu Asp Arg Cys 

355 360 365 

Asn His Leu Lys Arg Ser Glu Lys Glu Ser Pro Leu Phe Leu Leu Phe 
370 375 380 
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Leu Asp Ala Thr Trp Gin Leu Leu 
385 390 
Phe Ser Glu Thr Tyr Leu Ala Val 
405 

Leu Phe Gly Thr Phe Leu Phe Asn 
420 

Ser Thr Val Ser Arg lie Lys Ser 
435 440 
Ser Arg Val 
450 



PCT/CA03/00393 



Glu Gin Tyr Pro Ala Ala Phe Glu 
395 400 
Leu Tyr Asp Ser Thr Arg He Ser 

410 415 
Ser Pro His Gin Arg Val Lys Gin 
425 430 
Cys Thr Lys Gin Asp Tyr Phe Pro 
445 
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